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Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

We are transmitting herewith the attached: 

^ Transmittal sheet, in duplicate, containing Certificate Of Mailing Under 37 CFR 1.10. 

S Verified statement to establish small entity status - by The Regents of the University of California. 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant: Norbert O. Reich et al. Docket No.: 30794.30USWO 

Filed: February 3, 2000 

Title: MODULATORS OF DNA CYTOSINE-5 METHYLTRANSFERASE AND 

METHODS FOR USE THEREOF 



CERTIFICATE OF MAIUNG OR TRANSMISSION UNDER 37 CFR 1.10 

'Express Mail' mailing label number: EL307943991LK 

Date of Deposit: February 3, 2000 
I hereby certify that this correspondence is being deposited with the United States Postal Service 
jExpress Mail Post Office To Address' service under 37 CFR 1.10 and is addressed to: Assistant 
Commissioner for Patents, Washington, D.G 20231. 

By_; 

Name: Darlene 



D and is addressed to: Assistant 
me: Darlene Ross 7 



PRELIMINARY AMENDMF,NT 

Assistant Q>mmissioner for Patents 
Washington, D.C. 20231 
Dear Sir: 

In connection with the above-identified application filed herewith, please enter the 
following preliminary arnendment: 

IN THE CLAIMS 

Please cancel claims 1-20 in the WIPO application and insert new claims 1-25 as 

follows: 

1 . A synthetic oligonucleodde comprising a 05 methylcytosine and which 
recognizes and binds an allosteric site on DNAcytosine meth^^tmnsferase (DCMTase) 
thereby modulating DCMTase activity associated with the allosteric site. 

2. The synthetic oligonucleotide of claim 1, wherein the modulating comprises 
inhibition. 

3. The synthetic oUgonucleotide of claim 1, wherein the modulating comprises 
activation. 

4. The synthetic oligonucleotide of claim 1, wherein the C5 meth}4cytosine is 
present as a 5mCpG dinucleotide. 
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5. The synthetic oligonucleotide of claim 1, wherein the DCMTase is from a 
mammal, bird, fish, amphibian, reptile, insect, plant or fungus. 

6. The synthetic oligonucleotide of claim 5, wherein the mammal is selected 
from the group consisting of mouse and hviman. 

7. The synthetic oligonucleotide of claim 1 having an inhibition constant of 
not greater than 1000 nM. 

8. The synthetic oligonucleotide of claim 7 having an inhibition constant of 
not greater than 200 nM. 

9. The synthetic oligonucleotide of claim 8 havir^ an inhibition constant of 
not greater than 20 nM. 

10. The synthetic oligonucleotide of claim 1 comprising a nucleotide sequence 
as shown in F^ure IB and designated GGbox b^^ (SEQ ID NO:10), GCbox p^^ (SEQ 
ID NO:10), GCbox c^^(SEQ ID NO:13), GCbox d^T(SEQ ID NO:14), GCbox e™^ 
(SEQ ID NO:15), or CRE a^'r(SEQ ID NO:ll). 

11. A method of inhibiting meth)4ation of DNA comprising contacting a 
DCMTase with a synthetic inhibitor molecule so as to form an enzyme/synthetic inhibitor 
molecule complex in the presence of the DNA, wherein the synthetic inhibitor molecule 
comprises a C5 meth)4cytosine which recognizes and binds an allosteric site on DCMTase, 
thereby inhibiting DNA meth}dtransferase activity. 

12. A method of inhibiting proliferation of cancer cells comprising 
administering to a subject a syndietic inhibitor molecule which recognizes and binds an 
allosteric site on DCMTase thereby resulting in an enzyme/synthetic inhibitor molecule 
complex, the presence of the complex inhibiting DCMTase- mediated meth}^tion of DNA, 
thereby inhibiting proliferation of the cancer cells. 

13 . The method of claim 12, wherein the cancer cell is from lung, bieast, 
prostate, pancreas or colon. 

14. The method of claim 1 1 , wherein the synthetic inhibitor molecule is a 
synthetic oligonucleotide comprising a C5 meth}icytosine and which recognizes and binds 
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an aEosteric site on DNAcytosine methyltransferase (DCMTase) thereby modulating 
DCMTase activity associated with the allosteric site. 



15. The method of claim 12, wherein the subject is a human. 

16. The method of claim 12, wherein the subject is an animal. 

17. The method of claim 16, wherein the animal is porcine, piscine, avian, 
feline, equine, bovine, ovine, caprine or canine. 

18. A method of identifybg a molecule which recognizes and binds an allosteric 
site on DCMTase comprisii^: 

(a) contacting a molecule with DCMTase in the presence of DNA and 
AdoMet; 

(b) 

measuring DCMTase activity, an increase or decrease in DCMTase activity 
being indicative of a modulator of DCMTase; and 

(c) determining whether the modulation of DCMTase activity is via binding an 
allosteric site on DCMTase. 

19. The method of claim 18, wherein the modulator is an inhibitor. 



20. 



The method of claim 18, wherein DCMTase activity is measured using a 



steady-state assay. 

2 1 . The method of claim 12, wherein the synthetic inhibitor moleciale 
comprises a C5 methylcytosine. 

22. The method of claim 12, wherein the synthetic inhibitor molecule is a 
synthetic oligonucleotide comprising a C5 meth)icytosine and which recognizes and binds 
an allosteric site on DNAcytosine methyltransferase (DCMTase) thereby modulating 
DCMTase activity associated with the allosteric site. 

23. The method of claim 14, wherein the subject is a human. 

24. The method of claim 14, wherein the subject is an animal. 
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25. The method of claim 24, wherein the animal is porcine, piscine, avian, 
feline, equine, bovine, ovine, caprine or canine. 



REMARKS 



The above preliminary amendment is made to introduce the claims filed under 
per Article 34 in the WIPO application, and to remove multiple dependencies. 

Applicant respectfully requests that the preliminary amendment described herein 
be entered in to the record prior to calculation of the filing fee and prior to examination 
and consideration of the above- identified application. 

If a telephone conference would be helpful in resolving any issues concerning this 



communication, please contact Applicant's primary attomey-of- record, Karen S. Canady at 
(310) 642-4148. 



Respectfully submitted, 



Norbert O. Reich et al. 



By their attorneys, 



GATES & COOPER 



6701 Center Drive West, Suite 1050 
Los Angeles, California 90045 
(310) 641-8797 



Date: February 3, 2000 




Karen S. Canady 
Reg. No.: 39,927 



KSC/dr 
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SMALL BUSINESS 

STAra,MENT (DECLARATION) CLAIMNG SMALL ENTITY 
* ■ (3/ CF.R L9(f) AND L27(c)) - SMALL BUSINESS CONCERN 

I hereby declare that I am: 

^ an official of the small business concern empowered to act on behalf of the concern identified below. 

NAME OF ODNCERN: EpiGenX Pharmaceutical 

ADDRESS OF CONCERN: 2124 Bath Street 

Santa Barbara, California 93105 

I hereby declare that the above- identified small business concern qualifies as a small business as defined in 13 CF.R, 
121.801-805, and reproduced in 37 C.F.R 1.9(d) for purposes of paying reduced fees under Section 41(a) and (b) of Title 35, 
United States Code, in that the number of employees of the concern, including those of its affiliates, does not exceed 500 
persons. For purposes of this statement, (1) the number of employees of the concern is the average over the previous fiscal 
year of the concern of the persons employed on a full-time, part-time or temporary basis during each of the pay periods of the 
fiscal j'ear, and (2) concerns arc affiliates of each other when either, directly or indirectly, one concern controls or has the pov,-cr 
to control the other, or a third party or parties controls or has the power to control both. 

I hereby declare that rights under contract or law have been conveyed to and remain with the small business concern 
identified above with regard to the invention, entitled: MODULATORS OF DNA CYTOSINE-5 

METHYLTRANSFERASE AND METHODS FOR USE THF.RF.OF by inventor(s) Norbert O. Reich and James Flynn 
described in; 

E International Application No. PCr/US98/12351 filed in the United States Receiving Office on June 12, 1998. 

If the rights held by the above- identified small business concern are not exclusive, each individual, concern or 
organization having rights to the invention listed below^''' and no rights to the invention are held by any person, other than the 
inventor, who could not qualify as an independent inventor under 37b GF.R l,9(c) or by any concern which woidd not qualify 
as a small business concern under 37 GFJl. 1.9(d) or a nonprofit organization under 37 C.F.R 1.9(e). *NOTE: Separate 
verified statements are required from each named person, concern or organization having rights to the invention averring to 
their stams as small entities. (37 C.F.R 1.27) ^ 

NAME The Regents of the University of California 

ADDRESS nil Franklin Street. 12^^ Floor, Oakland California 94607-5200 

□ INDIVIDUAL □ SMALL BUSINESS ^ NONPROFIT ORGANIZATION 

NAME 

ADDRESS 

□ INDIVIDUAL □ SMALL BUSINESS □ NONPROFIT ORGANIZATION 

I acknowledge the duty to file, in this application or patent, notification of any change in status resulting in loss of 
entidement to small entity status prior to paying, or at the time of paying, the earliest of the issue fee or any maintenance fee due 
after the date on which status as small entity is no longer appropriate. (37 C.F.R 1.28(b)) 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made on 
information and belief are believed to be true; and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both under Section 1001 of Title 18 of the United 
States Code, and that such willful false statements may jeopardize the validity of the application, any patent issuing thereof, or 
any patent to which this verified statement is directed. 

NAME: David L. Cluck 

TITLE: President for Business 



DATE: f7 ^p.iMi'TA2Ml^ 



ADDRESS: 2124 Badi Street 

Santa Barbara, California 93105 

SIGNATURE: ^^"^X-^J, Q&uA 
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NONPROFIT ORGANIZATION 
VERIFIED STATEMENT pECLARATTON) CLAIMING SMALL ENTITY STATUS 
(37 CF.R 1.9(e) AND L27(d.)) - NONPROFIT ORGANIZATION 

I hereby declare that I am an official empowered to act on behalf of the nonprofit organization identified below: 

NAME OF ORGANIZATION: The Regents of the Univereity of California 

ADDRESS OF 1111 Franklin Street, 12'^ Floor 

ORGANIZATION: Oakland, California 94607 

TYPE OF NONPROFIT ORGANIZATION: * . |j 

a) S UNIVERSITY OR OTHER INSTITimON OF HIGHER EDUCATION 

b) g| TAX EXEMPT UNDER INTERNAL REVENUE SERVICE CODE (26 U.S.C. 501(a) and 501(c)(3)) 

c) □ NONPROFIT SaENTlFIC OR EDUCATIONAL UNDER STATUTE OF STATE OF THE UNITED 

STATES OF AMERICA 

(T^Alvffi OF STATE ) 

(QTATION OF STATUTE_ ) 

d) □ WOULD QUALIFY AS TAX EXEMPT UNDER INTERNAL REVENUE SERVICE CODE (26 U.S.C 

501(a) and 501(c)(3)) IF LOCATED IN THE UNITED STATES OF AMERICA 

e) □ WOULD QUALIFY AS NONPROFIT SQENTIFIC OR EDUCATIONAL UNDER STATUTE OF 

STATE OF THE UNITED STATES OF AMERICA IF LOCATED IN THE UNITED STATES OF 
AMERICA 

(NAME OF STATE ) 

(NAME OF STATUTE ) 

I hereby declare that the nonprofit organization identified above qualifies as a nonprofit organization as defined 
in 37 CF.R 1.9(e) for purposes of paying reduced fees under Section 41(a) and (b) of Tide 35, United States Code, in 
regard to the Invention, entitled: MODULATORS OF DNACYTOSINE-5 METHYLTRANSFERASE AND 
METHODS FOR USE THEREOF by inventor(s) Norbert O. Reich and James Flynn described in: 

K International Application No. PCT/US98/12351 filed in the United States Receiving Office on 
June 12, 1998. 

I hereby declare that rights under contract or law have been conveyed to and remain with the nonprofit 
organization with regard to the above-identified invention. 

If the rights held by the nonprofit organization are not excltisive, each individual, concern or organization having 
rights to the invention Ksted below=^ and no rights to the mvention are held by any person, other than the inventor, who 
could not qualify as an independent mventor under 37b CF.R 1.9(c) or by any concern which would not qualify as a 
small business concern under 37 CJ.R 1.9(d) or a nonprofit oi^anization under 37 C.F.R 1.9(e). *NOTE; Separate 
verified statements are required from each named person, concern or oi^anization having lighis to the invention 
averring to their status as small entities. (37 C,F.R 1.27) 

NAME EpiGenX Pharmaceuticals 

ADDRESS 2124 Bath Street, Santa Barbara, California 93105 

□ INDIVIDUAL S SMALL BUSINESS □ NONPROFIT ORGANIZATION 

NAME 
ADDRESS 

□ INDIVIDUAL □ SMALL BUSINESS □ NONPROFIT ORGANIZATION 

I acknowledge the duty to file, in this application or patent, notification of any change in status resulting in loss of 
entidement to small entity status prior to paying, or at the time of paying, the earHest of die issue fee or any maintenance 
fee due after the date on which status as small entity is no longer appropriate. (37 CF.R 1.28(b)) 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made on 
Information and belief are believed to be true; and further that these statements were made with the knovdedge that 
willful false statements and the like so made are punishable by fine or imprisonment, or both under Section 1001 of Title 



18 of the'United States Code, and that such willful false statements may jeopardize the vaKdity of the application, any 
patent issuing thereof, or any patent to which this verified statement is directed. 

NAME; Linda S. Stevenson 

TITLE: Principal Prosecution Analyst 

ADDRESS: 1 11 1 Franklin Street, 5'^ Floor 

Oakland, California 94607-5200 



SIGNATURE: 
DATE: 
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MODULATORS OF DNA CYTOSINE-5 METHYLTRANSFERASE 
AND METHODS FOR USE THEREOF 

This application is based on United States provisional patent application serial number 
5 60/057,41 1 , filed August 29, 1 997, the entire contents of which are hereby incorporated 
by reference into this application. Throughout this application various publications are 
referenced. The disclosures of these publications in their entireties are hereby 
incorporated by reference into this application in order to more fully describe the state of 
the art to winch this invention pertains. 

10 This invention was made with Government support under Grant No, GM46333, 

awarded by the National Institutes of Health to Norbert O. Reich. The Government has 
certain rights in this invention. 

BACKGROUND OF THE INVENTION 

In eukaryotic organisms, DNA methylation is catalyzed by anS'-adenosyl-L- 
methionine (AdoMet)' -dependent DNA cytosine-C^ methyltransferase (DCMTase, 
EC 2.1.1.37). Methyl group transfer to the cytosine-C^ position occurs predominately 
within the cytosyl-guanosyl (CpG) context (Boyes, J., & Bird, A.P., 1991, DNA 
methylation inhibits transcription indirectly via a methyl-CpG binding protein. Cell 
64:1 123-1 134). The genomic distribution of 5-methylcytosine (5-inC) dynamically 
changes throughout ontogeny (Razin, A., 8c Riggs, A.D., 1980, DNA methylation and 
gene function. Science 210:604-609; Kafri, T. et al., 1992, Developmental pattern of 
gene-specific DNA methylation in the mouse embryo and germ line, Genes and Dev. 
6:705-714). The methylation state of a gene specifically affects transcription. 



25 DCMTase is involved in mammalian development by way of an undefined process 
that can lead to gene regulation (reviewed in Jost, J.P., & Saluz, H.P., 1993, DNA 
Methylation: Molecular Biology and Biological Significance, Birkhauser Verlag, 
Basel). Proper DCMTase function is essential for viable development and for normal 
cellular activity (Li, E. et al., 1992, Targeted mutation of the DNA methyltransferase 

30 gene resuhs in embyonic lethality. Cell 69:91 5-926). 

Cytosine methylation is the predominant epigenetic event in the modification of 
eukaryotic DNA. To date only a single DCMTase has been identified in several 
metazoan organisms (Yoder, J. A., et al., 1996, New 5' regions of the murine and 
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human genes for DNA cytosine-5 methyltransferase, J. Biol. Chem. 271:31092- 
3 1097). The function most often identified with cytosine methylation (5-'"C) in 
higher eukaryotes is the regulation of transcription (Jost, J.P., & Saluz, H.P., 1993, 
DNA Methylation: Molecular Biology and Biological Significance, Birkhauser 
5 Verlag, Basel). Generally, hypermethylated genes are transcriptionally silent and 

inheritance of the proper genomic methylation pattern is critical to viable development 
as shown by DCMTase gene knock-outs in mice (Li, E., et al., 1992, Targeted 
mutation of the DNA methyltransferase gene results in embryonic lethality. Cell 
69:15-926). Anti-sense directed inactivation of DCMTase mRNA as well as the 
10 incorporation of the cytosine analogs 5-azacytidine and 5-fluorocytidine into DNA 
interfere with DCMTase function and lead to cytological dysfunction (Ramachandani, 
S., et al., 1997, Inhibition of tumorigenesis by a cytosine-DNA, methyltransferase, 
antisense oligodeoxynucleotide, Proc. Natl. Acad. Sci. USA 94:684-689; Jones, P.A., 
1985, Altering gene expression with 5-azacytidine, Cell 40:485-486). 

15 

Eukaryotic DCMTase cDNAs have been cloned and sequenced; five are from animal 
sources (mouse: Bestor, T., et al., 1988, Cloning and sequencing of a cDNA encoding 
DNA methyltransferase of mouse cells, J. Mol. Biol. 203:971-983; human: Yen, 
R.C., et al., 1992, Isolation and characterization of the cDNA encoding human DNA 

20 methyltransferase. Nucleic Acids Res. 20:2287-2291 ; chicken: Tajima, S., et al., 
1995, Isolation and expression of a chicken DNA methyltransferase cDNA, J. 
Biochem. 117:1050-1057; frog: Kimura et al., 1996, Isolation and expression of a 
Xenopus laevis DNA methyltransferase cDNA, Joumal of Biochemistry, 120:1 182- 
1 1 89; sea urchin: Aniello et al, 1 996, Isolation of cDNA clones encoding DNA 

25 methyltransferase of sea urchin P. lividus: expression during embryonic development. 
Gene 178:57-61). These DCMTases are composed of a large amino-terminal domain 
and a smaller carboxy-terminal domain that contains many of the major motifs foiuid 
in prokaryotic DCMTases (Posfai, J., et al., 1989, Predictive motifs derived from 
cytosine methyltransferases. Nucleic Acids Res. 17:2421-2435). The amino-terminal 

30 domain has been implicated in nuclear localization to DNA replication foci during S- 
phase (Leonhardt, H., et al, 1992, A targeting sequence directs DNA 
methyltransferase to sites of DNA replication in mammalian nuclei, Cell 71 :865- 
873), metal binding by zinc finger domains, and DNA binding (Bestor, T.H., 1992, 
Activation of the mammalian DNA methyltransferase by cleavage of aZn binding 

35 regulatory domain, EMBO 1 1 :261 1-261 7; Chuang, L.S., et al., 1 996, Characterisation 
of independent DNA and multiple Zn-binding domains at the N terminus of human 
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DNA-(cytosine-5) methyltransferase: modulating the property of a DNA-binding 
domain by contiguous Zn-binding motifs, Chia, J., and Li, B.F.L., J. Mol. Biol. 
257:935-948). 

5 Although the cellular processes that determine the genomic patterns of DNA 

methylation are not understood, DCMTase has an essential role in these processes. A 
basic understanding of the binding and catalytic DNA sequence specificity 
(discrimination) of the enzyme, and the factors which regulate this specificity are 
important. Since the mammalian en2yme is a relatively large, 1 83 kDa protein, DNA 

1 0 sequences flanking the cognate CpG may modulate the ability of the en2yme to 

methylate particular CpG sites. However, the CpG flanking sequence preferences of 
the enzyme, and its preference for single- and double-stranded substrates have not 
been rigorously addressed by previous investigators (Bestor, T.H. et al., 1992, CpG 
islands in mammalian gene promoters are inherently resistant to de novo methylation, 

15 GATA 9:48-53; Hepburn, P.A., et al., 1991, Enzymatic methylation of cytosine in 
DNA is prevented by adjacent O^-methylguanine residues, J. Biol. Chem. 266:7985- 
7987; Bolden, A.H., et al, 1986, Primary DNA sequence determines sites of 
maintenance and de novo methylation by mammalian DNA methyltransferases, Mol. 
Cell. Bio. 6:1135-1 140; Pfeifer, G.P., et al., 1985, Mouse DNA-cytosine-5- 

20 methyltransferase: sequence specificity of the methylation reaction and electron 
microscopy of enzyme-DNA complexes, EMBO J. 4:2879-2884; Ward, C, et al., 
1987, In vitro methylation of the 5 '-flanking regions of the mouse b-globin gene, J. 
Biol. Chem. 262:1 1057-1 1063; Carotti, D., et al., 1986, Substrate preferences of the 
human placental DNA methyltransferase investigated with synthetic 

25 polydeoxynucleotides, Biochim. et Biophys. Acta. 866:135-143; Carotti D. et al, 
1986, supra; Wang, R.Y.H., et al., 1984, Human placental DNA methyltransferase: 
DNA substrate and DNA binding specificity, Nucl. Acids Res. 12:3473-3490; Pfeifer 
et al., 1985, supra; Gruenbaum, Y., et al., 1982, Substrate and sequence specificity of 
a eukaryotic DNA methylase, Nature 295:620-622). 

30 

There is evidence that errors in the proper maintenance of genomic methylation are 
involved in aging and cancer. CpG islands are reported to become hypermethylated 
with age and may down-regulate expression of essential genes (Antequerra & Bird, 
1993, Number of CpG islands and genes in human and mouse, Proceedings of the 
35 National Academy of Sciences, USA, 90:1 1995-1 1999; Nyce, J.W., 1997, Drug- 
induced DNA hypermethylation: A potential mediator of acquired drug resistance 
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during cancer chemotherapy, Mutation Research 386:153-161) Amplification of 
DCMTase expression by an exogenous mammalian DCMTase gene induces 
tumorigenic transformation of NIH 3T3 mouse fibroblasts (Wu et al., 1993, 
Expression of an exogenous eukaryotic DNA methy transferase gene induces 
5 ti-ansformation of NIH 3T3 cells, Proc. Natl. Acad. Sci., USA, 90:8891-8895). 

Human neoplastic cells and cells derived from different stages of colon cancer express 
up to 200-fold higher levels of DCMTase than normal (El-Deiry et al., 1991, High 
expression of the DNA methyltransferase gene characterizes human neoplastic cells 
and progression stages of colon cancer, Proc. Natl. Acad. Sci., USA, 88:3470-3474). 

10 This contributes substantially to timior development in a mouse model of intestinal 
neoplasia (Laird, P.W., et al., 1995, Suppression of intestinal neoplasia by DNA 
hypomethylation, Cell 81:197-205). Changes in DNA methylation and DCMTase 
activity appear early in oncogenesis (Belinsky, S.A., et al., 1996, Increased cytosine 
DNA-methyltransferase activity is target-cell-specific and an early event in Ivmg 

15 cancer, Proc. Natl. Acad. Sci. USA 93:4045-4050). 

Conversely, antisense oligonucleotides that interfere with expression of DCMTase 
may inhibit tumorigenesis (Ramachandani et al, 1997, Inhibition of tumorigenesis by 
a cytosine-DNA methyltransferase, antisense oligonucleotide, Proc. Natl. Acad. Sci., 

20 USA, 94:684-689; MacLeod & Szyf, 1 995, Expression of antisense to DNA 

methyltransferase mRNA induces DNA demethylation and inhibits tumorigenesis, J. 
Biol. Chem. 270:8037-8043). The anticancer agent 5-aza-deoxycytidine functions by 
inhibiting tiie DCMTase (Jones, 1985, Altering gene expression witii 5-azacytidine, 
Cell 40:485-486; Juttemian et al., 1994, Toxicity of 5-aza-2'-deoxycytidine to 

25 mammalian cells is mediated primarily by covalent trapping of DNA 

methyltransferase rather than DNA demethylation, Proc. Nati. Acad. Sci., USA, 
91:11 797-1 1 801). Changes in DNA methylation and DCMTase activity early in 
oncogenesis (Belinsky, S.A., et al., 1996, supra) and the ability of DCMTase 
inhibitors to virtually abolish adenoma formation in mice (Laird, P.W., et al., 1995, 

30 supra) suggest that DCMTase inhibitors might be useful anticancer therapeutics (Szyf, 
M., 1996, The DNA methylation machinery as a target for anticancer therapy, 
Pharmacol. Ther 70:1-37). 5-Aza-deoxycytidine is an irreversible, mechanism- 
based DCMTase inhibitor that has been used in patients with acute myeloid leukemia. 
Unfortunately, 5-Aza-deoxycytidine is unstable in solution and may be carcinogenic 

35 as well as mutagenic (Jones, P.A., 1996, DNA methylation errors and cancer. Cancer 
Res. 56:2463-2467). There is a need for DCMTase inhibitors that do not require 
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incorporation into DNA and that are mechanistically unlike 5-aza-deoxycytidine 
(Belinsky, S.A., et al., 1996, supra; Szyf, M., 1996, supra; Jones, 1996, supra). A 
keen understanding of how DCMTase functions in vitro can be the basis for better 
strategies to both activate and inhibit the enzyme to correct developmental disorders 
5 like cancer. 

Enzymes that catalyze one carbon additions to of pyrimidines define a class of 
enzymes with similar chemistry (Ivanetich, K.M., & Santi, D.V., 1992, 5,6- 
Dihydropyrimidine adducts in the reactions and interactions of pyrimidines with 

10 proteins, Prog. Nucleic Acid Res. Mol. Biol. 42:127-156). The bacterial DNA 

cytosine methyltransferase, M.Hha\ (38 kDa Mr), modifies the internal cytosine in 
GCGC and has an ordered Bi Bi kinetic mechanism in which DNA binds first (Wu, 
J.C., & Santi, D.V., 1987, Kinetic and catalytic mechanism of Hhal methyltransferase, 
J. Biol. Chem. 262:4778-4786). Catalysis involves nucleophilic attack of an active 

15 site cysteine at the position of the cytosine which, in the absence of the cofactor, 
leads to exchange of the hydrogen. A M.//7zaI— DNA cocrystal structure suggests 
that a catalytic intermediate exists that involves the translocation of the target cytosine 
to an extrahelical position covalently bound to an active site cysteine (Klimasauskas, 
S., et al., 1994. Hhal methyltransferase flips its target base out of the DNA helix. Cell 

20 76:357—369). Methyl transfer from AdoMet is followed by {3— elimination to 

regenerate the active enzyme (Wu & Santi, 1987, supra; Osterman, D.G., et al., 1988, 
5-Fluorocytosine in DNA is a mechanism-based inhibitor of Hhal methylase. 
Biochemistry 27:5204-5210). 

25 A recent kinetic study of a highly homogeneous, unproteolyzed preparation of 
DCMTase fi:om mouse erj^hroleukemia cells (MEL) further characterized the 
interactions of the enzyme with DNA and AdoMet (Flynn, J., et al., 1996, Murine 
DNA cytosine-C5 methyltransferase: Pre-steady- and steady-state kinetic analyses 
with regulatory DNA sequences. Biochemistry 35:7308-7315). The invention 

30 disclosed herein descriptively accoimts for the previously reported complexities in 
kinetic behavior and identifies a potent single-stranded oligonucleotide inhibitor that 
binds to the enzyme at a distinct regulatory site. 

There is a need for molecules which modulate the methylation of DNA for the reasons 
35 discussed above. In addition, molecules which inhibit DNA methylation can be useful 
for preventing drug resistance acquired by subjects undergoing cancer chemotherapy. 
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Drug-induced DNA hypermethylation is regarded as a potential mediator of this 
acquired drug resistance (Nyce, J.W., 1997, Drag-induced DNA hypermethylation: A 
potential mediator of acquired drug resistance during cancer chemotherapy. Mutation 
Research 386:153-161). 

5 

SUMMARY OF THE INVENTION 

The invention provides synthetic oligonucleotides comprising a C-5 methylcytosine. 
The oligonucleotide recognizes and binds an allosteric site on DNA methyltransferase 
thereby inhibiting DNA methyltransferase activity. In one embodiment, the synthetic 
1 0 oligonucleotide has an inhibition constant of not greater than 1 000 nM. In another 

embodiment, the synthetic oligonucleotide has an inhibition constant of not greater than 
200 nM. In yet another embodiment, the synthetic oligonucleotide has an inhibition 
constant of not greater than 20 nM. 

1 5 The invention fiirther provides a composition comprising a synthetic oligonucleotide 
comprising a C-5 methylcytosine and which recognizes and binds an allosteric site on 
DNA methyltransferase. The composition is useful for inhibiting DNA 
methyltransferase activity, thereby inhibiting the methylation of DNA. In one 
embodiment, the composition is a pharmaceutical composition comprising a 

20 pharmaceutically effective amount of a synthetic oligonucleotide comprising a C-5 
methylcytosine and which recognizes and binds an allosteric site on DNA 
methyltransferase, and optionally, a pharmaceutically acceptable carrier. The 
pharmaceutical composition is useful for treating disorders associated with methylation 
defects, such as cancer and certain developmental disorders. 

25 

The invention further provides a method of mhibiting methylation of DNA. The method 
involves contacting a DCMTase with a synthetic oligonucleotide which recognizes and 
binds an allosteric site on DNA methyltransferase thereby resulting in a DNA 
methyltransferase/synthetic oligonucleotide complex. The complex is contacted with the 
30 DNA. The presence of the complex prevents binding of AdoMet to DNA 

methyltransferase in a catalytically competent manner thereby inhibiting DNA 
methyltransferase activity and inhibiting methylation of DNA. In one embodiment, the 
synthetic oligonucleotide comprises a C-5 methylcytosine. 

3 5 The invention further provides a method of treating a disorder of cell proliferation or 
development. The method involves administering to a subject a synthetic 
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oligonucleotide which recognizes and binds an aliosteric site on DNA methyltransferase. 
The binding of the synthetic oligonucleotide prevents binding of AdoMet to DNA 
methyltransferase in a catalytically competent manner thereby inhibiting DNA 
methyltransferase. The inhibition of DNA methyltransferase prevents the methylation of 
5 DNA thereby treating the disorder of cell prohferation or development. In one 

embodiment, the synthetic oligonucleotide comprises a C-5 methylcytosine. In one 
embodiment, the disorder of cell prohferation is cancer such as lung cancer, breast 
cancer, prostate cancer, pancreatic cancer or colon cancer. 

1 0 The invention also provides a method of identifying a modulator of DCMTase which 
recognizes and binds an aliosteric site on DCMTase. The method comprises contacting 
a molecule with DCMTase in the presence of AdoMet and DNA. The method further 
comprises measuring DCMTase activity. An increase or decrease in DCMTase activity 
is indicative of a modulator of DCMTase activity. In one embodiment, the modulator is 

1 5 an inhibitor. In another embodiment, the modulator is an activator. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 A shows six synthetic oligonucleotides that mimic the GC-box and the cyclic 
AMP responsive elements (CRE) (SEQ ID NOS:9-12). The appropriate consensus is 
20 in bold type and the single, centrally located CpG dinucleotide is underlined (mC = 
C-5 methylcytosine). The complementary a, aMET, b, and bMET strands were 
annealed to produce umnethylated, a/b, and hemi— methylated, aMET/b or a/bMET, 
double-stranded substrates. 

25 Figure IB shows oligonucleotide sequences corresponding to SEQ ID NOS: 10, 1 1 , 
13, 14 and 1 5, as indicated, which were tested for inhibition in an in vitro assay. GC- 
box pMET has a phosphorothioate backbone, while the others have a deoxyribose 
backbone. Kjj is the inhibition constant derived from the y— intercept values from 
double reciprocal plots and is a characteristic of the inhibitor binding to the aliosteric 

30 site of the enzyme. IC50 is the concentration of inhibitor that produces 50% activity of 
an xminhibited reaction. 

Figure 2 is an autoradiogram showing the results of gel mobility shift analysis varying 
DCMTase with constant GC-box a/b. Lane 1 : 0 nM DCMTase; Lane 2: 5.0 nM 
35 DCMTase; Lane 3 : 1 0 nM DCMTase; Lane 4: 20 nM DCMTase; Lane 5 : 30 nM 
DCMTase; Lane 6: 35 nM DCMTase; Lane 7: 40 nM DCMTase; Lane 8: 45 nM 
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DCMTase; Lane 9: 50 nM DCMTase; Lane 10: 65 nM DCMTase; Lane 1 1 : 75 nM 
DCMTase; Lane 12: 95 nM DCMTase. 

Figure 3 is an autoradiogram showing the results of a gel mobility shift analysis 
5 varying GC-box a/b with constant DCMTase. Lane 1 : 0.050 |j,M Free DNA; Lane 
2: 0.10 \iM Free DNA; Lane 3: 0.15 ^iM Free DNA; Lane 4: 0.10 ^iM DNA; Lane 5: 
0.28 ^iM DNA; Lane 6: 0.45 jiM DNA; Lane 7: 0.63 \iM DNA; Lane 8: 0.80 ^M 
DNA ; Lane 9: 1.0 fiM DNA ; Lane 10: 2.0 fiM DNA; Lane 1 1 : 2.0 Free DNA. 
Lanes 1,2,3, and 11 are control experiments without added DCMTase. 

10 

Figure 4 is an autoradiogram showing the results of a gel mobility shift analysis 
varying GC-box a/b with constant DCMTase. Lane 1: 0.050 (xM Free DNA; Lane 2: 
0.10 Free DNA; Lane 3: 0.15 |iM Free DNA; Lane 4: 0.10 ^iM DNA; Lane 5: 
0.50 |xM DNA; Lane 6: 1.0 |aM DNA; Lane 7: 4.0 ^M DNA ; Lane 8: 6.0 nM DNA; 
15 Lane 9: 6.0 |j,M Free DNA. Lanes 1, 2, 3, and 9 are control experiments without added 
DCMTase. 

Figure 5 is an autoradiogram showing the results of a gel mobility shift analysis 
varying GC-box b with constant DCMTase. Lane 1:0.10 ^M Free DNA; Lane 2: 
20 0.20 \iM DNA; Lane 3: 0.40 nM DNA; Lane 4: 0.80 ^iM DNA; Lane 5: 1 .6 ^M DNA; 
Lane 6: 3.2 ixM DNA; Lane 7: 6.4 ^iM DNA; Lane 8: 3.2 ^iM DNA; Lane 9: 6.0 yM 
CRE a fh. Lanes 1 and 8 are control experiments vvithout added DCMTase. 

Figure 6 shows a randomized DNA substrate used in in vitro screening (SEQ ID 
25 NOS: 16— 1 7). The top strand shown was synthesized using b— cyanoethyl 

phosphoramidite chemistry. The PCR primers used for amplifying the shifted DNA 
are underlined. Primer C is underlined and contains an EcoRI restriction site. Primer 
D, underlined twice, contains a BamHI restriction site and was annealed to the 
randomized top strand for extension by Klenow polymerase The randomized positions 
30 are denoted as N and are either dG, dA or dT on one strand and the complementary 
dC, dA or dT on the other strand of the duplex. 

Figure 7 shows cloned and sequenced individual isolates fi-om the pooled generations 
(SEQ ID NOS: 18-1 00 respectively). Only the guanine containing strand is shown for 
35 simplicity. Generation-5 members are arranged with the highest guanine content on 
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the 5' side of the invariant CpG at the top. Frequency information is given for each 
randomized flank on the appropriate border, an asterisk denotes a single occurrence. 

Figure 8A shows the nucleotide frequency at each randomized flanking position for 
5 the generation— 5 screening in the form of a bar graph indicating the percent 

occurrence of each nucleotide at the randomized positions. The predominance of 
guanosine extends over the entire randomized region. The horizontal line at 33% is 
representative of the starting pool frequencies. The line at 70% is added as a visual 
aid. 

10 

Figure 8B lists the nucleotide percentages at each randomized position for the 
generation-5 screening. 

Figure 9 show^s genomic sequences similar to the DCMTase selected generation-5 
15 clones (SEQ ID NOS:101-110). Fasta searches through the mouse and human 
GenBank libraries produced these matches v^^hen limited to no greater than four 
mismatches and no gaps. The definitions have been edited from the original enfries. 

Figure 10 shows initial velocity curves of the selected generations. Squares, 
20 generation-1 pool; triangles, generation-2 pool; circles, generation-4 pool; 
diamonds, generation-5 pool. 

Figure 1 1 shows substrate inhibition plots. Reactions contained 3.0 nM DCMTase and 
10 laM AdoMet in MR buffer. The inset shows data in which GC— box a/b was the 
25 substrate, using 100 nM DCMTase. Experimental data are shown scattered around a 
line fit to equation 1 for substrate inhibition. For a direct comparison of the DNA 
substrates, data are expressed as a Vmax normalized, S/K„P^^ ratio. 

Figure 12A shows double reciprocal plots of velocity versus substrate concentration. 
30 Poly(dI-dC:dI-dC) was varied and lines represent a constant AdoMet concentration: 
triangles, 4 \iM; squares, 2 fxM; diamonds, l|xM; circles, 0.5 |j.M. Experimental data 
are shovra scattered around lines derived from the fit of equation 2 for a sequential 
mechanism. 

35 Figure 12B shows double reciprocal plots of velocity versus substrate concentration. 
AdoMet was varied and lines represent a constant poly(dI-dC:dI-dC) concentration: 
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triangles, 112 pM; squares, 56 pM; diamonds, 28 pM; circles, 14 pM. Experimental 
data are shown scattered around lines derived from the fit of equation 2 for a 
sequential mechanism. 

5 Figure 1 3 shows a double reciprocal plot of velocity versus poly(dI-dC:dI-dC) with 
varying GC-box b concentrations. The GC-box b concentrations were: diamonds, 0; 
circles, 0.75 |4,M; triangles, 1.5 ]iM; squeires, 5.0 |xM. Experimental data are shown 
scattered around lines derived from the fit to equation 5 for noncompetitive inhibition. 

10 Figure 14 is a double reciprocal plot of velocity vs. poly(dI-dC:dI-dC) with varying 
GC— box b^ET concentrations. The GC-box b^^T concentrations were: squares, 0; 
circles, 1 0 nM; diamonds, 20 nM; triangles, 40 nM. Experimental data are shown 
scattered around lines derived from a fit to the log form of equation 6 for 
uncompetitive inhibition. 

15 

Figiire 1 5 shows a double reciprocal plot of velocity versus AdoMet with varying 
GC-box b concentrations. The GC-box b concentrations were: squares, 0; 
circles, 20 nM; diamonds, 40 nM; triangles, 80 nM. Experimental data are shown 
scattered around lines derived from a fit to equation 4 for competitive inhibition. 

20 

Figure 16 shows a double reciprocal plot of AdoHcy product inhibition with varying 
AdoMet concentrations. The AdoHcy concentrations were: squares, 0; diamonds, 0.75 
^M; circles, 1.5 |j.M; triangles, 3.0 \iM; notched squares, 6.0 ^iM. Experimental data 
are shown scattered around lines derived from a fit to equation 4 for competitive 
25 inhibition. 

Figure 17A shows a double reciprocal plot of AdoHcy product inhibition with varying 
poIy(dI-dC:dI-dC) concentrations, in which AdoMet was held constant at 1.2 p,M. The 
AdoHcy concentrations were: squares, 0; diamonds, 15 (xM; circles, 30 \iM. 

30 

Figure 17B shows a double reciprocal plot of AdoHcy product inhibition with varying 
poly(dI-dC:dI-dC) concenfrations, in which AdoMet was held constant at 8 ^M. The 
AdoHcy concentrations were: squares, 0; diamonds, 15 ^M; circles, 30 nM. 

35 Figure 17C shows a double reciprocal plot of AdoHcy product inhibition with varying 
poly(dI-dC:dI-dC) concenfrations. The AdoHcy concenfrations were: squares, 0; 
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diamonds, 15 |j,M; circles, 30 |xM. These are secondary slope replots from a series of 
experiments in which the AdoMet concentrations were: circles, 6.3 ^M; diamonds, 
2.5 nM; squares 1 |iM. 

5 Figure 1 8 shows a double reciprocal plot of poly(dId™C:dId'"C) product inhibition 
with varying AdoMet concentrations. The poly(dId'"C:dId"'C) concentrations were: 
squares, 0; diamonds, 5.0 pM; circles, 10 pM; triangles, 20 pM. Experimental data are 
shown scattered around lines derived from a fit to equation 5 for noncompetitive 
inhibition. 

10 

Figure 19A shows a double reciprocal plot of poly(dId'"C:dId'"C) product inhibition 
with varying poly(dI-dC:dI-dC) concentrations. The poly(dId'"C:dId"'C) concentrations 
were: squares, 0; triangles, 34 pM; circles, 45 pM; diamonds, 68, notched squares, 90 
pM. Experimental data are shovra scattered around lines derived from a fit to equation 
15 4 for competitive inhibition. The fitting is not acceptable. 

Figure 19B shows a double reciprocal plot of poly(dId"'C:dld'°C) product inhibition 
with varying poly(dI-dC:dI-dC) concentrations. The poly(dId"'C:dId'"C) concentrations 
were: squares, 0; triangles, 34 pM; circles, 45 pM; diamonds, 68, notched squares, 90 
20 pM. Experimental data are shown scattered around lines derived from a fit to equation 
5 for noncompetitive inhibition. The fitting is not acceptable. 

Figure 20 shows initial velocity plots of different poly(dIdC:dIdC) lengths. The 
poly(dI-dC:dI-dC) sizes were: circles, 100 base-pairs; diamonds, 500 base-pairs; 
25 triangles, 2000 base-pairs; squares, 5000 base-pairs. The inset provides a zoom in 
along the z-axis toward the origin to show the quality of the data. 

Figure 21 shows a plot of DCMTase specificity as a fiinction of poly(dIdC:dIdC) 
length. The apparent constants were derived from Figure 20 and are shovm in Table 6. 
30 The data was fit well by an isotherm that yielded a half-maximal length of 1 200 
base-pairs and a maximal specificity value of 29 x 10^* hr~^pM~' with 
poly(dIdC:dIdC) as the substrate. 

Figure 22 shows a proposed kinetic mechanism. DCMTase appears to progress 
35 through the catalytic cycle by the Ordered Bi-Bi mechanism shown. 
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Figure 23A is a double-reciprocal plot of poly(dId'"C:dId'"C) product inhibition with 
varying poly(dI-dC:dI-dC) concentrations. Reactions contained 20 nM DCMTase and 
1.5 iiU AdoMet in 100 mM Tris pH 8.0, 10 mM EDTA, 10 mM DTT, 200 ^ig/mL 
BSA. Incubations were at 37 °C for 60 minutes. The poly(dI-dC:dI-dC) concentrations 
5 were 20, 40, 80, 1 20 and 1 60 pM. The poly(dId™C:dId"'C) concentrations were: 
squares, 0; triangles, 34 pM; circles, 45 pM; diamonds, 68, notched squares, 90 pM. 
Shown are the intersecting noncompetitive lines. 

Figure 23B is the slope replot of the plot shown in Figure 23 A. 

Figure 23C is the>^intercept replot obtained from the lines in Figure 23 A. 

10 DETAILED DESCRIPTION OF THE INVENTION 

The invention provides a synthetic oligonucleotide comprising a C-5 methylcytosine 
and which recognizes and binds an allosteric site on DNA cytosine methyltransferase 
thereby modulating DCMTase activity associated with the allosteric site. In one 
embodiment, the modulating comprises inhibition. In another embodiment, the 
15 modulating comprises activation. The C-5 methylcytosine of the synthetic 
oligonucleotide can be present as a 5mCpG dinucleotide. 

In one embodiment, the DCMTase is from a mammal, bird, fish, amphibian, reptile, 
insect, plant, bacterium, virus or fimgus. The mammal can be selected from the group 
20 consisting of mouse and human. 

In one embodiment, the synthetic oligonucleotide comprises a nucleotide sequence as 
shown in Figure IB and designated GC-box b^^"^ (SEQ ID NO: 1 0), GC-box p'^ 
(SEQ ID NO: 1 0), GC-box c^ (SEQ ID NO: 1 3), GC-box d'^ ^ (SEQ ID NO: 14), 

25 GC-box e^ (SEQ ID NO: 1 5), or CRE a"^"" (SEQ ID NO: 11). hi one embodiment,the 
synthetic oligonucleotide has an inhibition constant of not greater than 1000 nM by 
steady-state kinetic assay. In another embodiment, he synthetic oligonucleotide has an 
inhibition constant of not greater than 200 nM by steady-state kinetic assay. In yet 
another embodiment, the synthetic oligonucleotide has an inhibition constant of not 

30 greater than 20 nM by steady-state kinetic assay. 

In accordance with the practice of the invention, the oligonucleotide can be DNA, RNA, 
or a derivative or hybrid thereof. The invention further provides a composition 
comprising a synthetic oligonucleotide comprising a C-5 methylcytosine and which 
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recognizes and binds an allosteric site on DNA methyltransf erase. The composition is 
usetul for inhibiting DNA methyltransferase activity, thereby inhibiting the methylation 
of DNA. In one embodiment, the composition is a pharmaceutical composition 
comprising a pharmaceutically effective amount of a synthetic oligonucleotide 
5 comprising a C-5 methylcytosine, or a pharmaceutically acceptable salt thereof, and 
which recognizes and binds an allosteric site on DNA methyltransferase. In one 
embodiment, the pharmaceutical compositon further comprises a pharmaceutically 
acceptable carrier. The pharmaceutical composition is usefiil for treating disorders 
associated with methylation defects, such as cancer and certain developmental disorders. 

10 

The invention ftirther provides a method of inhibiting methylation of DNA. The method 
involves contacting a DNA methyltransferase with a synthetic oUgonucleotide which 
recognizes and binds an allosteric site on DNA methyltransferase thereby resulting in an 
enzyme/synthetic oligonucleotide complex. The presence of the complex prevents 

15 binding of AdoMet to DNA methyltransferase in a catalytically competent manner 

thereby inhibiting DNA methyltransferase activity and inhibiting methylation of DNA. 
In one embodiment, the enzyme/s3Tithetic olignucleotide complex forms a further 
complex with DNA. In one embodiment, the synthetic oligonucleotide comprises a C-5 
methylcytosine. In one embodiment, the C-5 methylcytosine is present as a 5mCpG 

20 dinucleotide. 

The invention further provides a method of treating a disorder of cell proliferation or 
development. The method involves administering to a subject a synthetic inhibitor 
molecule which recognizes and binds an allosteric site on DNA methyltransfer^e. The 

25 binding of the synthetic inhibitor molecule prevents binding of AdoMet to DNA 
methyltransferase in a catalytically competent manner thereby inhibiting DNA 
methyltransferase. The inhibition of DNA methyltransferase prevents the methylation of 
DNA thereby treating the disorder of cell proliferation or development. In one 
embodiment, the synthetic oligonucleotide comprises a C-5 methylcytosine which 

30 recognizes and binds an allosteric site on DCMTase thereby inhibiting DNA 

methyltransferase activity. In one embodiment, the disorder of cell proliferation is 
cancer, such as lung cancer or colon cancer. In one embodiment, the disorder of 
development is one linked to a genetic locus regulated by methylation. Examples of 
such disorders include, but are not limited to, Huntington's disease, Down's syndrome, 

35 and disorders associated with a Hox gene. 
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The invention provides a method of inhibiting proliferation of cancer cells comprising 
administering to a subject a synthetic inhibitor molecule which recognizes and binds an 
allosteric site on DCMTase thereby resulting in an enzyme/synthetic inhibitor molecule 
complex, inhibiting DCMTase-mediated methylation of DNA, and thereby inhibiting 

5 proliferation of the cancer cells. In one embodiment, the cancer cell is from lung or 
colon, hi one embodiment, the synthetic inhibitor molecule is an oligonucleotide 
comprising a C-5 methylcytosine which recognizes and binds an allosteric site on 
DCMTase thereby inhibiting DNA methyltransferase activity. In one embodiment, the 
C-5 methylcytosine is present as a 5mCpG dinucleotide. In one embodiment, the 

10 synthetic oligonucleotide comprises a nucleotide sequence as shovra in Figure IB and 
designated GC-box b"^^ (SEQ ID NO: 10), GC-box p^ ^ (SEQ ID NO: 1 0), GC-box 
c'^^^(SEQ ID N0:13), GC-box d'^'^(SEQ ID N0:14), GC-box e^^'^(SEQ ID 
NO: 1 5), or CRE a'^'^ (SEQ ID NO:l 1). hi one embodiment, the subject is a human. In 
another embodiment, the subject is an animal. In one embodiment, the animal is selected 

15 from a group consisting of porcine, piscine, avian, feline, equine, bovine, ovine, caprine 
and canine. 

Definitions 

All scientific and technical terms used in this application have meanings commonly used 
20 in the art unless otherwise specified. As used in this application, the following words or 
phrases have the meanings specified. 

As used herein "synthetic oligonucleotide comprising a C-5 methylcytosine" means any 
non-naturally occurring oligonucleotide comprising a C-5 methylcytosine. The 

25 oligonucleotide can be a RNA, DNA or a derivative or hybrid thereof. The C-5 

methylcytosine can be in the form of a 5mCpG dinucleotide. In one embodiment, the 
C-5 methylcytosine is centrally located within the oligonucleotide. In one embodiment, 
the synthetic oligonucleotide of the invention can be approximately 5 to approximately 
70 bases in length. In another embodiment, the synthetic oligonucleotide can be 

30 approximately 15 to approximately 50 bases in length. In another embodiment, the 

synthetic oligonucleotide can be approximately 20 to approximately 30 bases in length. 
In another embodiment, the synthetic oUgonucleotide is approximately 30 bases in 
length. Examples of synthetic oligonucleotides of the invention include, but are not 
hmited to, the oligonucleotides GC-box b^^ (SEQ ID NO: 10), GC-box p'^'^ (SEQ ID 

35 NO:l 0), GC-box c^ ^ (SEQ ID NO: 13), GC-box d^^"^ (SEQ ID NO: 14), GC-box e^^ 
(SEQ ID NO: 1 5), and CRE a^'^'^ (SEQ ID NO: 11 ) shown in Figure 1 B. 
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As used herein, "synthetic inhibitor molecule" includes synthetic molecules known in 
the art to facilitate entry of nucleic acids into cells and to minimize intracellular and 
intercellular breakdown of the nucleic acids. Examples of such antisense molecules 
5 include, but are not limited to, peptide nucleic acid (PNA) and phosphorothioate— based 
molecules such as deoxyribonucleic guanidine (DNG) or ribonucleic guanidine (RNG). 
Also included are nonnucleic acid polymers derived from a library screen which bind 
the same site as the synthetic oligonucleotide of the invention. 

10 As used herein, "an allosteric site" means a site other than an active site that can 
influence the catalytic progress of the enzyme. The influence can either inhibit or 
activate catalysis. For example, an active site on DCMTase includes the site to which 
AdoMet binds, the binding of AdoMet to the active site on DCMTase leading to the 
methylation of DNA. An active site is defmed as the local protein envirormient in close 

15 proximity to the reactive substituents in the methylation reaction. 

As used herein, "DNA methyltransferase activity" means erLzymatic activity that 
promotes transfer of a methyl group to DNA, thereby methylating DNA. An example 
of a source of a methyl group for transfer to DNA is AdoMet. 

20 

As used herein, "pharmaceutically acceptable salt" refers to a salt that retains the 
desired biological activity of the parent compound and does not impart any undesired 
toxicological effects. Examples of such salts include, but are not limited to, (a) acid 
addition salts formed with inorganic acids, for example hydrochloric acid, 

25 hydrobromic acid, sulfuric acid, phosphoric acid, nitric acid and the like; and salts 
formed with organic acids such as, for example, acetic acid, oxalic acid, tartaric acid, 
succinic acid, maleic acid, furmaric acid, gluconic acid, citric acid, malic acid, 
ascorbic acid, benzoic acid, tannic acid, pamoic acid, alginic acid, polyglutamic acid, 
naphthalenesulfonic acids, naphthalenedisulfonic acids, polygalacturonic acid; (b) 

30 salts with polyvalent metal cations such as zinc, calcium, bismuth, barium, 

magnesium, aluminum, copper, cobalt, nickel, cadmium, and the like; or (c) salts 
formed with an organic cation formed from N,N'-dibenzylethylenediamine or 
ethylenediamine; or (d) combinations of (a) and (b) or (c), e.g., a idnc tannate salt; and 
the like. The preferred acid addition salts are the trifluoroacetate salt and the acetate 

35 salt. 
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As used herein, "pharmaceutically acceptable carrier" includes any material which, 
when combined with a compound of the invention, allows the compound to retain 
biological activity and is non-reactive with the subject's immune system. Examples 
include, but are not limited to, any of the standard pharmaceutical carriers such as a 

5 phosphate buffered saline solution, water, emulsions such as oil/water emulsion, and 
various types of wetting agents. Preferred diluents for aerosol or parenteral 
administration are phosphate buffered saline or normal (0.9%) saline. 
Compositions comprising such carriers are formulated by well known conventional 
methods (see, for example. Remington's Pharmaceutical Sciences, Chapter 43, 14th 

10 Ed., Mack Publishing Co., Easton PA 1 8042, USA). 

Compounds of the Invention 

The invention provides a synthetic oUgonucleotide comprising a C-5 methylcytosine 
and which recognizes and binds an allosteric site on DN A methyltransferase thereby 

15 inhibiting DNA methyltransferase activity, hi one embodiment,the synthetic 

oligonucleotide has an inhibition constant of not greater than 1000 nM by steady-state 
kinetic ^say. In another embodiment, the synthetic oligonucleotide has an inhibition 
constant of not greater than 200 nM by steady— state kinetic assay. In yet another 
embodiment, the synthetic oligonucleotide has an inhibition constant of not greater than 

20 20 nM by steady-state kinetic assay. In one embodiment, the C— 5 methylcytosine is 
centrally located within the oligonucleotide. In one embodiment, the synthetic 
oligonucleotide of the invention can be approximately 5 to approximately 70 bases in 
length. In another embodiment, the synthetic oligonucleotide can be approximately 15 to 
approximately 50 bases in length. In another embodiment, the synthetic ohgonucleotide 

25 can be approximately 20 to approximately 30 bases in length. In a further embodiment, 
the synthetic ohgonucleotide is approximately 30 bases in length. Examples of synthetic 
oligonucleotides of the invention include, but are not limited to, the oligonucleotides 
shown in Figure IB and designated GC-box b^'^(SEQ ID NO: 10), GC-box p'^^^ 
(SEQ ID NO:10), GC-box c^'^(SEQ IDN0:13), GC-box d^^(SEQ ID NO: 14), 

30 GC-box e^^(SEQ ID N0:15), or CRE a^^(SEQ ID NO:l 1). 

Compositions Of The Invention 

The invention further provides a composition comprismg a synthetic oligonucleotide 
comprising a C-5 methylcytosine and which recognizes and binds an allosteric site on 
35 DNA methyltransferase. The composition is usefiil for inhibiting DNA 
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methyltransferase activity, thereby inhibiting the methylation of DNA. In one 
embodiment, the composition is a pharmaceutical composition comprising a 
pharmaceuticaily effective amount of a synthetic oligonucleotide comprising a C-5 
methylcytosine, or a pharmaceuticaily acceptable salt thereof, and which recognizes and 
5 binds an allosteric site on DNA methyltransferase. hi one embodiment, the 

pharmaceutical compositon further comprises a pharmaceuticaily acceptable carrier. The 
pharmaceutical composition is useful for treating disorders associated with methylation 
defects, such as cancer and certain developmental disorders. 

10 Administration of the Compositions 

In accordance with the methods of the invention, the synthetic oligonucleotide can be 
administered in a pharmaceutical composition in unit dosage form. The most effective 
mode of administration and dosage regimen for the molecules of the present invention 
depend upon the location of the tissue or disease being treated, the severity and course 
1 5 of the medical disorder, the subject's health and response to treatment and the 

judgment of the treating physician. Accordingly, the dosages of the molecules should 
be titrated to the individual subject. 

By way of example, the interrelationship of dosages for animals of various sizes and 
20 species and for humans based on mgW of surface area is described by Freireich, E.J., 
et al. Cancer Chemother., Rep. 50 (4): 219-244 (1966). It would be clear that the dose 
of the composition of the invention required to achieve an appropriate clinical 
outcome may be further reduced with schedule optimization. 

25 Methods of the Invention 

TTie invention fiirther provides a method of inhibiting methylation of DNA. The method 
involves contacting a DCMTase with a synthetic inhibitor molecule in the presence of 
the DNA. The synthetic inhibitor molecule comprises a C— 5 methylcytosine which 
recognizes and binds an allosteric site on DNA cytosine methyltransferase (DCMTase) 

30 thereby resulting in an enzyme/synthetic inhibitor molecule complex. The presence of 
the complex prevents DCMTase-mediated catalysis thereby inhibiting DCMTase 
activity and inhibiting methylation of DNA. In one embodiment, the synthetic 
oligonucleotide comprises a C— 5 methylcytosine. In a fiirther embodiment, the C— 5 
methylcytosine is present as a 5mCpG dinucleotide. Examples of synthetic inhibitor 

35 molecules include, but are not limited to, the oligonucleotides shown in Figure IB and 
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designated GC-box b'^^ (SEQ ID NO: 10), GC-box p^^ (SEQ ID NO: 10), GC-box 
c'^^^CSEQ ID N0:13), GC-box d'^'^(SEQ ID N0:14), GC-box e'^^'^(SEQ ID 
NO: 1 5), or CRE a*^^ (SEQ ID NO:l 1). 

5 The invention fiirther provides a method of treating a disorder of cell proliferation or 
development. The method involves administering to a subject a synthetic 
oligonucleotide which recognizes and binds an allosteric site on DCMTase. The binding 
of the synthetic oligonucleotide prevents DCMTase— mediated catalysis thereby 
inhibiting DCMTase activity. The inhibition of DCMTase prevents the methylation of 

1 0 DN A thereby treating the disorder of cell proliferation or development. In one 

embodiment, the synthetic oligonucleotide comprises a C-5 methylcytosine. In one 
embodiment, the disorder of cell proliferation is cancer, such as lung cancer or colon 
cancer. In another embodiment, the disorder of development is one linked to a genetic 
locus regulated by methylation. Examples of such disorders include, but are not limited 

1 5 to, Huntington's disease, Down's syndrome, and disorders associated with a Hox gene. 

The invention provides a method of inhibiting proliferation of cancer cells comprising 
administering to a subject a synthetic inhibitor molecule which recognizes and binds an 
allosteric site on DCMTase thereby resulting in an enzyme/synthetic inhibitor molecule 

20 complex. The presence of the complex prevents DCMTase catalysis thereby inhibiting 
DCMTase-mediated methylation of DNA, thereby inhibiting proliferation of the cancer 
cells. In one embodiment, the synthetic inhibitor molecule is an oligonucleotide 
comprising a C-5 methylcytosine which recognizes and binds an allosteric site on 
DCMTase thereby inhibiting DNA methyltransferase activity. In one embodiment, the 

25 cancer is lung cancer or colon cancer. In one embodiment, the method of inhibiting 
proliferation of cancer cells comprises administering to a subject the synthetic 
oligonucleotide of the invention in a sufficient amount so that the ohgonucleotide 
recognizes and binds an allosteric site on DCMTase so as to form an enzyme/synthetic 
oligonucleotide complex. 

30 

The invention provides a method of inhibiting hypermethylation of DNA comprising 
contacting a DNA cytosine methyltransferase (DCMTase) with a synthetic inhibitor 
molecule comprising a C~5 methylcytosine which recognizes and binds an allosteric site 
on DCMTase thereby resulting in an enzyme/synthetic inhibitor molecule complex, in 
35 the presence of the DNA. The presence of the complex prevents DCMTase catalysis 
thereby inhibiting DCMTase activity and inhibiting hypermethylation of the DNA. In 
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one embodiment, the synthetic oligonucleotide comprises a C-5 methylcytosine. In a 
further embodiment, the C-5 methylcytosine is present as a 5mCpG dinucleotide. The 
inhibition of hypermethylation of DNA is useful for preventing the development of 
resistance to drugs such as anti-cancer drugs. 

5 

The invention provides a method of inhibiting dmg resistance in a subject comprising 
administering to a subject the synthetic oligonucleotide of the invention in a sufficient 
amount so that the oligonucleotide recognizes and binds an allosteric site on DCMTase 
SO as to form an enzyme/synthetic oligonucleotide complex. The presence of the 
10 complex prevents DCMTase catalysis so as to inhibit DCMTase-mediated 

hypermethylation of DNA thereby inhibiting drug resistance. The synthetic inhibitor 
molecule can be administered to a subject prior to, concurrent with or after 
administration of an anti-cancer therapeutic agent to prevent overmethyiation of DNA 
induced in the subject's cells in response to the anti-cancer therapeutic agent. 

15 The invention additionally provides a method for screening molecules, such as those 
obtained fi-om a combinatorial library, to identify modulators of DCMTase which 
recognize and bind an allosteric site on DCMTase. In one embodiment, the modulator 
is an inhibitor of DCMTase. In another embodiment, the modulator is an activator of 
DCMTase. The method comprises contacting a molecule with DCMTase in the 

20 presence of AdoMet and DNA, and measuring DCMTase activity. An increase in 
DCMTase activity is indicative of an activator of DCMTase and a decrease in 
DCMTase activity is indicative of an inhibitor of DCMTase. DCMTase activity can 
be measured by methods known in the art, including the assays disclosed in the 
Examples provided herein. Those of ordinary skill in the art can identify a modulation 

25 of DCMTase activity that is indicative of binding an allosteric site on the en2yme. In 
a preferred embodiment, DCMTase activity is measured by a steady-state assay. One 
can plot enzyme activity as a function of varied concentrations of the molecule being 
tested, and also as a fiinction of varied concentrations of DNA and AdoMet. 
Preferably, a mathematical fit is performed on the plotted results. Competitive 

30 inhibition by DNA and uncompetitive inhibition by AdoMet, or competitive inhibition 
by AdoMet and uncompetitive inhibition by DNA, for example, would be indicative 
of an inhibitor molecule which recognizes and binds an allosteric site on DCMTase. 
Also included within the invention are modulators of DCMTase which recognize and 
bind an allosteric site on DCMTase, and which are identified by the above method. 
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Advantages of the Invention 

The invention disclosed herein provides a potent and reversible inhibitor of DNA 
methyltransferase that does not require incorporation into DNA. This inhibitor can be 
used to inhibit methylation of DNA and to treat disorders associated with DNA 
5 methylation defects, such as cancer and developmental disorders. 

In addition to identifying particular synthetic oligonucleotides which inhibit DNA 
methyltransferase, the invention provides information about the mechanism 
responsible for this inhibition. By identifying an ailosteric site on DCMTase as the 
10 site of action of the inhibitors, the invention provides a basis for developing and 

identifying variants of the particular synthetic oligonucleotides disclosed herein that 
will also be useful for inhibiting DNA methyltransferase. Additionally, the disclosure 
herein teaches that a C-5 methylcytosine is responsible for the potency of the 
inhibition effected by the synthetic oligonucleotides of the invention. 

15 

EXAMPLES 

The following examples are presented to illustrate the present invention and to assist 
one of ordinary skill in making and using the same. The examples are not intended in 
any way to otherwise limit the scope of the invention. 

20 

Example 1: DNA Binding Discrimination of the Murine DNA Cytosine-C" 
Methvltransferase 

In this example gel mobility shift analyses (GMS A) using defined sequences to 
estimate Kd^^'^ and in vitro screening method of a large, divergent pool of DNA, are 
25 used to determine discrimination of DCMTase. The results presented herein 

demonstrate that the DCMTase '.DNA complex is concluded to be thermodynamically 
stabilized by guanosine/cytosine-rich sequences flanking a central CpG cognate site. 

Materials 

30 DCMTase was purified from mouse erythroleukemia cells as previously described 
(Xu, G., et al. (1995) Biochemi. Biophysi. Res. Communi. 207:544-551). S- 
adenosyl-L-[methyl-3H]methionine (75 Ci/mmol, 1 mCi/ml, 1 Ci=37 GBq) was 
from Amersham Life Sciences (Arlington Heights, Illinois), Unlabeled AdoMet, 
purchased firom Sigma Chemical Company (St. Louis, MO), was further purified as 
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described (Reich, N.O. & Mashhoon, N. (1990) Inhibition of EcoRl DNA methylase 
with cofactor analogs. J. Biol. Chem. 265:8966-8970). Routinely, a 125 mM AdoMet 
stock concentration was prepared at a specific activity of 5.8 x lO^ cpm/pmol. DE81 
filters were purchased from Whatman Inc. (Lexington, MA). All other chemicals and 
5 reagents were purchased from Sigma Chemical Company (St. Louis, MO) or Fisher 
Scientific (Hampton, New Hampshire). 

DNA Substrate Preparation 

The preparation, purification, and analysis of six oligonucleotides that mimic the GC- 
10 box and the cyclic AMP responsive elements (CRE) were previously described 

(Flynn, J., et al., 1996, Murine DNA cytosine-C5 methyltransferase: Pre-steady and 
steady-state kinetic analyses with regulatory DNA sequences, Biochemistry 35:7308- 
7315) (Figiu-e 1). The percentage of double-stranded DNA in aimealed DNA samples 
was confirmed to be greater than 99% by ^^p-radiolabeling, poiyacrylamide gel 
1 5 separation, subsequent autoradiography and densitometry using a CCD camera and the 
SW5000 analysis package from Ultra Violet Products (UVP, San Gabriel, CA). 

Gel Mobility Shift Assays 

Gel mobility shift assays (GMSA) were performed with minor revisions to the original 

20 procedures (Fried, M., & Crothers, D.M., 1981, Equilibria and kinetics of the lac 

repressor-operator interactions by poiyacrylamide gel electrophoresis. Nucleic Acids 
Res. 9:6505-6525; Gamer, M.M. & Revzin, A., 1981, A gel electrophoresis method 
for quantifying the binding proteins to specific DNA regions: applications to 
components of the Escherichia coli lactose operon regulatory system. Nucleic Acids 

25 Res. 1 3 :3047-3060). All reactions were done in 1 00 mM Hepes pH 7.4, 1 0 mM 
EDTA, 10 mM DTT, 200 mg/ml BSA, 5% glycerol using the indicated ^^P-labeied 
DNA and DCMTase concentrations, incubated on ice for 5 minutes and loaded on a 
IxTBE (89 mM Tris-HCl pH 8.3, 89 mM boric acid, 2 mM EDTA), 6% 
poiyacrylamide gel. Electrophoresis was done at 250 V, 9 mA for 2 hours at 4 ^'C and 

30 the dried gel was exposed to film overnight. The reaction conditions for buffer, 
temperature, incubation time, cofactor addition and gel composition have all been 
optimized. Only slightly better complex resolution was obtained xmder the listed 
conditions compared to a 10 minute incubation at 37 °C prior to gel loading at room 
temperature and containing either cofactor S-adenosyl-L-methionine, product -S- 

35 adenosyl-L-homocysteine, or the AdoMet analog sinefungin. Hepes reaction buffer at 
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pH 7.4 produced sharper banding than Tris-HCl at pH 8.0. Initial binding assays, with 
a limiting, and constant DNA concentration, resulted in the formation of multiple 
bands. Subsequent assays used a limiting and constant enzyme concentration with 
varying DNA concentrations. 

5 

Binding Isotherm Determinations of Ko^^'^ 

Autoradiogram-derived band intensities corresponding to the mobility shifted 
DCMTase:DNA complexes were acquired using the UVP system described above. 
Background subtractions were from equivalent areas about one centimeter below each 

10 mobility shifted complex. The corrected intensities were then fit to a nonlinear 

binding isotherm and graphed using KaleidaGraph 2.1.2 software (Synergy Software, 
Reading, PA) The intensity of the labeled DNA in the protein: DNA complex at 
saturation was directly compared to uncomplexed DNA areas in control lanes 
containing 50%, 100% and 150% molar DNA equivalents of the DCMTase 

15 concentration. 

Screening for DNA binding preferences 

An in vitro selection approach was used to determine the DNA binding discrimination 
of DCMTase. A population of DNA molecules, each 66 base pairs long, were 

20 synthesized with a central CpG dinucleotide flanked on each side by 12 positions 

randomized with either adenosine, thymidine or cytidine; total complexity equal to 2.8 
x lO'' discrete sequences (Figure 6). Guanosine was not added to the randomization to 
avoid multiple CpG dinucleotides on a double-stranded DNA. The randomized 
regions are flanked by PCR primer regions that contain the restriction sites used for 

25 cloning. The first generation pool of DNA was made double-stranded by Klenow 
polymerase extension of primer D. 

The screening procedure was reiterated five times under the conditions listed in Table 
2 (see Results, infra). DNA substrates from each pooled generation that induced 

30 higher thermodynamic stabilities of the DCMTase.DNA complex were separated from 
lower affinity DNA by PAGE as described above. The region of the gel containing 
shifted DNA complexes was excised and five exchanges of 5 mL water over 72 hours 
shaking on ice was sufficient to elute greater than 95% of all cpm present in the 
excised gel slice as determined by Cerenkov counting. The eluted DNA was 

35 lyophilized, resuspended in TE ( 1 0 mM Tris-HCl at pH 8.0; 1 mM EDTA) and 
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cleaned by one phenohchloroform and two chloroform extractions followed by 
ethanol precipitation and resuspension in TE. The selected DNA pools were amplified 
using 20 rounds of PGR using Deep Vent polymerase (New England Biolabs) and the 
DNA primers shown in Figure 6. The 66 base pair DNA was separated from the PGR 
5 primers on agarose gels and purified using minor changes to the original procedure 
(Wieslander, L., 1979, A simple method to recover intact high molecular weight RNA 
and DNA after electrophoretic separation in low gelling temperature agarose gels, 
Anal. Biochem. 98:305-3). 

10 Identification of preferred DNA substrates 

Individual members from the selected DNA pools were identified by cleaving the 
DNA ends with BamRl and EcdRi endonuclease and cloning into pGEMl Izf- 
(Promega, Madison, WI) using standard protocols. The plasmid DNA from single 
isolates was prepared and the selected CpG flanking sequences were determined using 

15 the CircumVent sequencing kit (New England Biolabs, Beverly, Massachusetts). The 
selected inserts were sequenced from both strands using the T7 and SP6 sequencing 
primers (Promega, Madison, WI). Statistical analyses were performed using several 
programs in the Wisconsin Sequence Analysis Package (Genetics Computer Group, 
Madison, WI) and Kaliedagraph (Synergy Software, Reading, PA). Statistical 

20 signficance was determined by the Student's t-Test using Microsoft Excel, Microsoft, 
Redmond, WA. 

The selected generations were analyzed for initial velocity. The 50 mL reactions 
contained 50 nM DCMTase, 7 mM AdoMet and DNA at 4.7, 23, 47 and 230 nM in 
25 1 00 mM Tris pH 8.0, 1 0 mM EDTA, 1 0 mM DTT, 200 mg/mL BSA. AdoMet (5"- 

adenosyl-L-[methyl- H]methionine ([methyi-^H]AdoMet) (75 Ci/mmol, ImCi/ml, 1 
Ci = 37 Gbq) was purchased fi-om Amersham (Arlington Heights, IL). The 
incubations were for 1 hour at 37 °C. 

30 DNA with tritiated C-5 cytosines, deposited by the DCMTase, were separated from 
the tritiated AdoMet by spotting the reaction on DE 81 filters (Whatman, Lexington, 
MA) followed by a series of 200 mL washes; three in 50 mM HK2PO4 and one each in 
80% ethanol, 95% ethanol and ethyl ether. Dried filters were placed in 3 mL of 
LiquiScint (National Diagnostics, Atlanta, GA) and counted in a scintillation counter. 

35 Counts per minute were transformed to femtomoles of methyl groups deposited on 
DNA over the course of the reaction. 
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Results 

Gel mobility shift analyses ofGC-box and CRE cis- elements 

The preliminary experiments used a standard gel mobility shift assay in which a 
5 constant, low DNA concentration was titrated with higher protein concentrations. As 
shown in Figure 2, essentially all of the GC-box a/b (100 pM) binding occurred 
between 5 nM and 95 nM DCMTase. An initial complex was formed at the lower 
DCMTase concentrations and an abrupt shift of most of the free DNA is coincident 
with the formation of a second complex at about 20 nM DCMTase. Further addition 
10 of DCMTase resulted in the loss of the more mobile complex I in favor of a less 

mobile complex II. Similar results were observed for GC-box a/b^ET^ ^Jh, and 
CRE a^ET/i, Tjig complexes shown in Figure 2 contained the DCMTase, as addition 
of an antibody to DCMTase resulted in an increased shift of each complex. 
Coincubation of DCMTase and GC-box a/b with a 40-fold excess of unlabeled 
15 polydA:polydT, calculated on a dinucleotide basis, did not disrupt the specific 
DCMTaserDNA complex. 

The multiple banding of DCMTase:DNA complexes observed in Figure 2 are similar 
to results obtained with two cytosine DNA methyltransferases, M..Mspl (Dubey, A.K. 

20 & Roberts, R.J., 1992, Sequence-specific DNA binding by the Mspl DNA 

methyltransferase. Nucleic Acids Res. 20:3167-3173) and M.Hhal (Mi, S. & Roberts, 
R.J., 1993, The DNA binding affinity ofHhal methylase is increased by a single 
amino acid substitution in the catalytic center. Nucleic Acids Res. 21 :2459-2464; 
Reaie et al., 1995, DNA binding and methyl transfer catalyzed by mouse DNA 

25 methyltransferase, Biochem. J. 3 12:855-861) obtained similar gel shift results with a 
mammalian DCMTase and assumed that the slower migrating band contained two 
DCMTase molecules bound to a single DNA. 

Steady-state kinetic analyses of the DCMTase with the same 30 base-pair DNA 
30 substrates used in these studies indicate that Km^^^ is 1000- to 50,000-fold higher 
(Flynn, J., et al., 1996, Murine DNA cytosine-C5 methyltransferase: Pre-steady- and 
steady-state kinetic analyses with regulatory DNA sequences, Biochemistry 35:7308— 
73 1 5) than the DNA concentrations used to generate Figure 2 and used by Reale et al, 
1995, supra. The complexes formed in Figure 2 xmder limiting DNA and excess 
35 protein do not promote a detectable catalytic activity. Therefore, the stability of 
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protein:DNA was determined by keeping the enzyme at a constant concentration (1 00 
nM) and varying the amount of added DNA (Dubey & Roberts, 1992, supra). Figures 
3 through 5 show the results with this approach for GC-box a/b'*^^, GC-box a/b and 
GC-box b . In all cases a single shifted band is resolved, the binding isotherm data 

5 are fit well by a simple hyperbola, and each complex is saturable. The apparent Kd^^"^ 
estimations for the different forms of GC-box and CRE are summarized in Table 1 . 
The formation of equimolar proteiniDNA complexes is supported by comparisons of 
the band intensity for complexed DNA at saturation with the band intensities of 
control reactions containing 50, 100 and 150 nM DNA and no enzyme (Figures 3 and 

10 4, lanes 1,2 and 3). 



Table 1: Determinations of Kd'*'^^ for DCMTase by Gel Mobility Shift Assay." 



DNA Substrate Kd^'^^ (mM) 

GC-box a 1.2+/- 0.2 

GC-box b 1.3+/- 0.4 

GC-box bMET o.88 +/- 0. 1 3 

GC-box a/b 0.42+/- 0.15 

GC-box a^MET o.36 +/- 0. 1 1 



CRE a >50 

CRE b >50 

CRE a/b 1.5+/- 0.4 

CREaMET/b i q +/- 0.3 



^ The values presented were obtained from the relative intensities of bands 
corresponding to DCMTase:DNA complexes fit by non— linear regression as described 
in Experimental Procedures. 

30 

DCMTase:DNA complexes formed by high substrate concentrations travel with the 
same relative mobility as complex I in Figure 2. For DNA concentrations higher than 
about 10 times the apparent Kd™'^, the complexes become less mobile. The complex 
formed between DCMTase and single-stranded GC-box b (Figure 5, lane 7) is 
35 shown to migrate to approximately the same distance as the complex formed between 
DCMTase and hemi-methylated CRE a'^^^/b DNA (lane 9). 
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The estimates of apparent Kd™^ are consistent with our previous Km°^^ estimates 
(Table 1 and Flynn, J., et al., 1996, Murine DNA cytosine-C5 methyltransferase: Pre- 
steady- and steady-state kinetic analyses with regulatory DNA sequences, 

5 Biochemistry 35 :7308-73 1 5). The hemi-methylated double-stranded form of DNA 
was bound with a slightly higher affinity than theunmethylated double-stranded form 
for both the GC-box and CRE DNA. In support of CpG flanking sequence 
discrimination by DCMTase, the GC-box substrates of each duplex form had an 
approximate three-fold lower Kd"^"^^ than the corresponding CRE DNA form. Single- 

10 stranded substrates bound with less stability than double-stranded DNA. The binding 
of CRE single-strands was exceptionally poor and at the limits of resolution by this 
technique. GMSA was capable of resolving a binding discrimination in favor of 
guanosine/cytosine-rich sequences flanking a central CpG dideoxy nucleotide. 

1 5 Screening for DCMTase binding discrimination with a randomized DNA pool 

The sampling of several discrete sequences for binding specificity is laborious and 
prone to investigative prejudice. In order to understand the thermodynamic stability of 
DCMTase:DNA interactions in a diverse population of CpG sequence contexts, as 
might be expected in vivo, we devised an in vitro screening protocol that exploits the 

20 gel mobility shift assay. The reaction conditions used for each iterative generation of 
the screening are summarized in Table 2. The first round of screening contained ten 
times more DNA molecules than the maximal population complexity of 2.8 x lO" 
discrete sequences. An increasing fraction of the added randomized pool was shifted 
through the first three generations, during which the enzyme concentration was kept 

25 constant and the DNA concentration was decreased. The initial conditions were 
sufficient to stabilize binding of the DNA pool, so the selective pressure to 
discriminate between sequences was increased in generation-4 and 5 by decreasing 
both enzyme and DNA concentrations. The maximal population complexity in each 
generation decreases because only a fraction of the added DNA was shifted. The 

30 complexity of the starting population is divided by the percentage of DNA shifted in 
each generation and ultimately results in no more than 1.2 x 10"* discrete sequences in 
the generation-5 pool (Table 2). 
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Table 2: Binding conditions and gel shift results of in vitro screening" 



Maximal 

Iterative DCMTase DNA Percentage Population 

5 Generation Concentration Concentration *DNA Shifted Complexity 

0 2.8xl0ll 

1 68 nM 50 nM 1.5% 4.2x1 09 

2 68 nM 25 nM 12% 5.0x1 0^ 
10 3 68 nM 12.5 nM 25% 1.2x10^ 

4 5.0 nM 0.125 nM <1% 1.2x10^ 

5 0.50 nM 0.030 nM <1% 1.2x10^ 



" Listed are the enzyme and DNA substrate concentrations used in each round of 
15 selection. The Cerenkov cpm within the excised gel slice, containing the shifted 
complex, is shown as a percentage of the total Cerenkov counts loaded onto the gel. 
This percentage limits the complexity of the DNA pool, therefore it is used to 
calculate the maximal population complexity in each successive generation. 

20 Individual members from the starting pool and generations— 1, 3 and 5 were cloned 
and sequenced from both strands. Only the guanine containing strands are shown for 
simplicity in Figure 7, however, these studies were done using urmiethylated double- 
stranded substrates. Synthesis of the starting population is shown to be randomized at 
each position with the expected frequency approximating 1/3 each in guanine, adenine 

25 and thymine. 

The selected pools successively became more guanosine— rich ^^nth each generation. A 
total of 49 isolates were cloned and sequenced from the generation-5 pool and none 
were identical. Nucleotide, dinucleotide and trinucleotide frequencies were analyzed 

30 using the COMPOSITION (Wisconsin Sequence Analysis, Madison, WI). The 

selected nucleotides flanking the central CpG dinucleotide were 64.7% in guanine, 
13.8% in adenine and 21.6% in thymine. The mean frequency of guanine bases per 
generation-5 isolate was 14.5 out of the 24 selectable positions and more guanines 
were observed on the 5 '-flank compared to the 3 '-flank (p = 0.04). The far flanking 

35 regions are a full helical turn distal to the invariant CpG and are highly enriched in 
guanine as compared to regions proximal to the CpG. In addition to the abundance of 
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guanosyl-guanosyl (GpG) dinucleotides, guanosyl-thymidyl (GpT) and thymidyl- 
guanosyl (TpG) dinucleotides appear often and occur more frequently on the 3 '-flank 
(p = 0.01). Trinucleotide analyses reinforce the observations at the nucleotide and 
dinucleotide levels. The highest frequency of GpGpG was at the far 5 '-flank, while 

5 GpTpG and TpGpT trinucleotides were far more abundant in the 3' flank (p = 0.01). 
The discrimination exhibited by DCMTase for generation-5 sequences may reflect an 
important structural characteristic that contributes to stabilization of the initial 
DCMTaseiDNA complex. These analyses suggest that an ideal substrate has sequence 
assymmetry around the CpG and that there is a particular binding orientation of 

10 DCMTase on DNA. 

The guanine-richness at each randomized position for the generation-5 isolates is 
best shown in Figure 8. The murine DCMTase is a large 183,000 Da protein that 
selected for sequences extending over the entire 12 base— pairs provided for selection 

15 on each side of the central CpG. The Wisconsin Sequence Analysis program 
CONSENSUS was used to construct a common generation-5 sequence with a 
certainty level of 60%. The sequence GGGGGGGRRKKGCGKGGKGKKGKKGG 
(SEQ ID NO:l), where R is guanine or adenine and K is guanine or thymine, was 
obtained and is shown to highlight the guanine richness and the preference for GpT 

20 and TpG on the 3'— side of the CpG. At a certainty level of 80% the plasticity of 
sequence preferences can be seen close to the invariant CpG; 

KGGRKKRDDDKRmKRIODKKKKKKKG (SEQ ID NO:2) (D is guanine, thymine 
or adenine). We have not tested whether the DCMTase can select for sequences out 
further than 12 base-pairs or if multiple CpG dinucleotides are preferred over the 26 
25 base-pair expanse. 

Similar sequences occur frequently in the genome 

We subjected the 49 generation— 5 sequences to FASTA searches of the GenBank 
library to see if similar sequences exist in the genomes of higher eukaryotes. The 
30 search was limited in three ways. First, only the mouse and human sequences were 
searched, even though DCMTase activities have been identified in many metazoan 
organisms. Second, to be considered fiirther, a "hit" had to be identical at 22 of the 26 
base positions, including the central CpG. No hits were retrieved that had a higher 
identity. Third, no gaps in alignment were allowed. 

35 
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Remarkably, 20 "hits" were recovered from GenBank that met these severely 
restricted criteria. Figxxre 9 shows the alignments of the five hits from mouse and lists 
the 15 hits from htunan. A simplified, random genome would be expected have a 
complexity of 4^^, or 1 .8 x 10^^ base-pairs, in order to contain any of these sequences 

5 just once. Of course, this is an oversimplification. But, the results appear to be striking 
when considering the mammalian genome is approximately 3x10^ base-pairs, only 
about 40% in guanine plus cytosine, and about 10-fold deficient in CpG 
dinucleotides. The majority of hits are in what may be presumed to be regulatory 
regions of the genome; 5' or 3' untranslated regions (UTR) or in CpG islands. Many 

1 0 of the associated genes are also of developmental interest. For example, homeo box 
Hox2.6 and HoxA7 function in early body segmentation. These findings may reflect 
an intrinsic function of DCMTase in developmental programming. 

Control experiments eliminate a non-specific selection 

15 A control series of amplifications in the absence of DCMTase were done to show that 
our iterative PGR conditions were not responsible for the guanine selection observed 
with the generation-5 DNA. Endonuclease challenge was done with Taq I (5'- 
TCGA-3' restriction) and Aci I (5'-GCGG-3' restriction) to assess the randomness 
of mock selected and DCMTase selected pools. Although this is a limited sampling, 

20 the DNA specificity of these enzymes can discern the relative abundance of 
nucleotides immediately flanking the CpG, The guanine-richness is probed by Aci I 
and the adenine/thymine-richness is probed by Taq I. After endonuclease challenge of 
■'■^P labeled DNA, the products were resolved on a 12% polyacrylamide gel. Using 
densitometry, the intensities of the restricted bands were compared to unrestricted 

25 control bands. The mock generation-5 sequences immediately flanking the CpG 
remained random, approximately 6% of the DNA was restricted by each 
endonuclease, demonstrating that a non-specific selection did not occur under these 
experimental conditions. Results with the DCMTase selected generations were 
consistent with the guanine/cytosine selection determined firom sequencing individual 

30 clones. Also, consistent with a lack of selection fi-om the protocols alone, the entire 
mock selected pool was sequenced and compared to the DCMTase selected 
generation— 5 pool, similar to that done by Blackwell et al., 1991. An equal abundance 
of the randomized nucleotides was resolved for the mock selected pool and a 
guanine-rich population was resolved for the DCMTase selection. 
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The DCMTase-selected DNA from the iterative generations were compared to each 
other in binding and catalytic assays. The DCMTase binds the pooled generation-5 
sequences only two-fold more tightly than the starting pool. The inherent complexity 
of each pool makes it difficult to assess the true preference for each generation as a 

5 whole. The question of sequence specificity was more accurately addressed by GMSA 
of the discrete sequences, CRE a/b and GC— box a/b. There we found that the 
guanine/cytosine-rich GC-box was preferred approximately 3-fold compared to the 
more adenine/thymine-rich CRE sequence. Figure 10 shows the initial velocity plots 
for the starting population and generations-2, 4 and 5. The catalj^ic specificity for the 

1 0 selected generations increases at each cycle, with little change in Km"^'^ and a two- 
fold increase in kcat- 

Discussion 

Because it is the catalytic agent for c3ftosine methylation, DCMTase clearly has a 

15 central role in both maintaining DNA methylation patterns and in establishing new 
"epi-genotypes". The fundamental issues of binding and catalytic discrimination of 
the mammalian enzyme for different DNA sequences have been actively debated. 
Many reports have suggested that the ability of the enzyme to methylate the cognate 
CpG dinucleotide depends to some degree on flanking sequences (Bolden, A.H., et al., 

20 1986, Primary DNA sequence determines sites of maintenance and de novo 
methylation by mammalian DNA methyltransferases, Mol. Cell. Bio. 6:1135-1140; 
Bestor, T.H., et al., 1992, CpG islands in mammalian gene promoters are inherently 
resistant to de novo methylation, GATA 9:48-53; Hepbum, P.A., et al., 1991, 
Enzymatic methylation of cytosine in DNA is prevented by adjacent O^- 

25 methylguanine residues, J. Biol. Chem. 266:7985-7987; Pfeifer, G.P., et al., 1985, 
Mouse DNA-cytosine-5-methyltransferase: sequence specificity of the methylation 
reaction and electron microscopy of enzyme-DNA complexes, EMBO J. 4:2879- 
2884; Ward, C, et al, 1987, In vitro methylation of the 5 '-flanking regions of the 
mouse b-globin gene, J. Biol. Chem. 262:11057-1106; Carotti, D., et al, 1986, 

30 Substrate preferences of the human placental DNA methyltransferase investigated 
with synthetic polydeoxynucleotides, Biochim. et Biophys. Acta 866:135-143; Smith, 
S.S., et al., 1992, Mechanism of human methyl-directed DNA methyltransferase and 
the fidelity of cytosine methylation, Proc. Natl. Acad. Sci. USA 89:4744-4748), while 
others describe the lack of any flanking sequence effects (Bestor, T.H. and Tycko, B., 

35 1996, Creation of methylation pattems, Nature Genetics 12:363-367; Carlson, L., et 
al., 1992, Properties and localization of DNA methyltransferase in preimplantation 
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embryos: implications for genomic imprinting, Genes and Development 6:2536- 
2541). These studies used partially purified or proteolyzed enzyme, substrates 
containing multiple CpG sites, and compared relative velocities obtained at a single 
substrate DNA concentration, thereby precluding an accurate estimation of specificity 
5 (otherwise known as discrimination). 

Similarly, reports regarding the preference of DCMTase for single- and double- 
stranded substrates are also in direct conflict with one another (Adams, R.L.P., et al., 
1986, Mouse ascites DNA methyltransferase: characteristic of size, proteolytic 

10 breakdovra and nucleotide recognition, Biochim. Et Biophys. Acta 868:9-1 6; Smith et 
al., 1992, supra; Carotti et al., 1986, supra; Wang, R.Y.H., et al., 1984, Human 
placental DNA methyltransferase: DNA substrate and DNA binding specificity, Nucl. 
Acids Res. 12:3473-3490; Pfeifer et al, 1985, supra; Gruenbaum, Y., et al., 1982, 
Substrate and sequence specificity of a eukaryotic DNA methylase. Nature 295:620- 

15 622; Christman, J.K., et al., 1995, 5-Methyl-2'-deoxycytidine in single-stranded 
DNA can act in cis to signal de novo DNA methylation. Proc. Natl. Acad. Sci. USA 
92:7347-7351). 

A recent steady— state kinetic analysis with uimiethylated GC— box and CRE DNA 
20 sequences showed compensatory 3- to 4-fold changes in Km^^"^ and kcat that resulted 
in a small discrimination at the level of kcat/Km'^^^ (Flynn, J., et al., 1996, Murine 
DNA cytosine-C5 methyltransferase: Pre-steady- and steady-state kinetic analyses 
with regulatory DNA sequences, Biochemistry 35:7308-7315). In this Example, the 
sequence— dependent discrimination of DCMTase is quantitatively addressed at the 
25 level of Kd°^^. The thermodynamic binding constant, Kd^^"^, is a characteristic of the 
initial enzyme:DNA complex and Km'^^'^ has an additional term accounting for the 
forward reaction rate. DCMTase:DNA interactions were investigated with discrete 
DNA sequences of biological importance, and v^th a large divergent pool of DNA 
sequences. The discrimination between unmethylated single- and double-stranded 
30 DNA, and unmethylated and hemi-methylated double-stranded DNA was also 
quantified. 

DCMTase binding to DNA is stabilized by guanine/cytosine—rich sequences 

Gel mobility shift assays were used to determine the apparent dissociation constants, 
35 Kd^'^'^, of the enzyme for different forms of the GC-box and CRE cw-regulatory 
elements. Complex, higher-order interactions were observed under the more standard 
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conditions of limiting DNA and varying protein concentrations. While the multiple 
protein:DNA complexes and unusual DNA concentration dependence are shown to 
involve the DCMTase, accurate quantitative analysis is precluded due to the 
uncertainty of binding stoichiometry and the relative affinities of each binding event 

5 (Senear, D.F., «fe Brenowitz, M., 1991, Determination of binding constants for 
cooperative site-specific protein-DNA interactions using the gel mobility shift assay, 
J. Biol. Chem. 266:13661-13671; Sackett, DX. & Saroff, H.A., 1996, The multiple 
origins of cooperativity in binding to multi-site lattices, FEES 397:1-6). Whereas 
many DNA-binding proteins, including DNA adenine-N^ methyltransferases (Reich, 

10 N.O. & Mashhoon, N., 1990, Inhibition of EcoRl DNA methylase with cofactor 
analogs, J. Biol. Chem. 265:8966-8970), form a single protein:DNA complex under 
similar conditions, bacterial and mammalian DNA cytosine methyltransferases are 
known to produce multiple complexes at low DNA concentrations (Dubey, A.K. & 
Roberts, R.J., 1992, Sequence-specific DNA binding by the Mspl DNA 

15 methyltransferase. Nucleic Acids Res. 20:3167-3173; Mi, S. & Roberts, R.J., 1993, 
The DNA binding affinity of Hhal methylase is increased by a single amino acid 
substitution in the catalytic center. Nucleic Acids Res. 21:2459-2464; Reale, A., et 
al., 1995, DNA binding and methyl transfer catalyzed by mouse DNA 
methyltransferase, J. Biochem. 312:855-861). The multiple complexes formed with 

20 excess enzyme and DNA concentrations far below Km™"^ may be common to cytosine 
DNA methyltransferases. These complexes are known to be catalytically incompetent 
in the case of the murine enzyme (Flynn, J., et al, 1996, Murine DNA cytosine-C5 
methyltransferase: Pre-steady- and steady-state kinetic analyses with regulatory 
DNA sequences. Biochemistry 35:7308-7315). 

25 

Gel mobility shift assays performed with micromolar DNA concentrations and 
limiting DCMTase resuh in a single, shifted DNA band. These observations are again 
similar to those described for the bacterial cytosine DNA methyltransferases, M.Mspl 
(Dubey & Roberts, 1992, supra) and M.Hhal (Mi & Roberts, 1993, supra); the 

30 determination of equilibrium constants under these conditions is valid and not. In fact, 
our enzyme preparation obeyed classical Michaelis-Menton kinetics with the same 
substrates when assayed in the same DNA concentration range (Flynn et al., 1996, 
supra). Also, the estimated Kd°^^ values reported in Table 1 are similar to those 
previously reported at the level of Km™^ with the same DNA (Flynn et al, 1996, 

35 supra). The Kd^'^'^ values are about one-half of those determined at the level of 

Km^^'^ for the same double-stranded substrates. The lack of large differences between 
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these constants suggests that steps following the initial formation of a specific 
protein: DNA complex do not contribute largely to Km™"^. 

DCMTase bound DNA in a 1:1 stoichiometry and had a strong preference for binding 
5 double-stranded DNA over single-stranded DNA. Hemi-methylated DNA was 

bound by the enzyme with slightly higher affinity thanunmethylated double-stranded 
DNA. The Kd^^"^ data fiirther supports the interpretation that the preference for hemi- 
methylated DNA versus unmethylated double-stranded DNA derives almost entirely 
from changes in the methylation rate constant, kmethyiation (Flynn, J., et al., 1996, 

10 Murine DNA cytosine-C5 methyltransferase: Pre-steady- and steady-state kinetic 
analyses with regulatory DNA sequences. Biochemistry 35:7308-7315). A recent 
study of M.//7iaI:hemi-methylated DNA and M.///zaI:DNA cocrystal structures 
attempted to rationalize the two to three-fold discrimination manifested by this 
enzyme at the level of binding (O'Gara M., et al., 1996, A structural basis for the 

15 preferential binding of hemimethylated DNA by Hhal DNA methyltransferase, J. Mol. 
Bio. 263:597-606). These authors proposed that the binding discrimination derives 
mostly from a single van der Waals' contact between the Glu"^^^ carboxylate and the 
methyl group of the 5-methyl-2'deoxycytidine. While the DCMTase also has a 
glutamate at this position (Glu^^^*), we suggest that other differences in the assembly 

20 of the active site contribute to the quantitatively larger preference for hemi- 
methylated DNA shown by the murine enzyme. 

The two base-pair, CpG . cognate sequence of the mammalian DCMTase is small 
compared to the cognate sites of most bacterial DNA methyltransferases. DNA 

25 footprint analyses ofMSssI, M.Hhal and M.Mspl are consistent with protein:DNA 
interactions extending over 16 base-pairs (Renbaum, P. & Razin, A., 1995, Footprint 
analysis of M.SssI and M.Hhal methyltransferases reveals extensive interactions with 
the substrate DNA backbone J. Mol. Biol. 248:19-26; Dubey, A.K. & Roberts, R.J., 
1992, Sequence-specific DNA binding by theMspI DNA methyltransferase. Nucleic 

30 Acids Res. 20:3 1 67-3 1 73). Thus, the large mammalian DCMTase protein (Glickman, 
J.F. & Reich, N.O., 1997, Peptide mapping of the murine DNA methyltransferase 
reveals a major phosphorylation site and the start of translation, J. Biol. Chera. In 
press) most likely involves DNA contacts outside of this minimal sequence. Support 
for this is provided by the observation that the guanine/cytosine-rich GC-box element 

35 (GGGGCGGGGC (SEQ ID N0:3)) is bound approximately 3-fold more tightly than 
the adenine/thymine-rich CRE element (TGACGTCA). An in vitro selection method 
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was designed to define both the span of the protein:DNA interface, and the sequence 
preference of the enzyme for nucleotides flanking the consensus CpG. Previous 
applications of this strategy were useful in defining a consensus sequence for DNA 
binding proteins involving large differences in binding energetics between random 

5 and target sequences (Kinzler, K.W. & Vogeistein, B., 1989, Whole genome PGR: 
application to the identification of sequences bound by regulatory proteins, Nucl. 
Acids Res. 17:3645-3653; Thiesen, H. & Bach, C, 1990, Target detection assay 
(TDA): A versatile procedure to determine DNA binding sites as demonstrated on 
SPl protein, Nucl. Acids Res. 18:3203-3209; Blackwell, T.K., et al., 1990, 

10 Sequence-specific binding by the c-Myc protein, Mol. Cell. Bio. 13:5216-5224; He, 
Y., Stockley, P.O. & Gold, L., 1996, In vitro evolution of the DNA binding sites of 
Escherichia coli methionine repressor, MetJ. J. Mol. BioL 255:55-66). These 
selection strategies were extended to identify flanking sequence preferences, where 
binding discrimination is expected to be much less than when searching for a six to 

15 ten base— pair cognate site. One potential outcome would be the lack of any 

preference, as described for the UBF protein using this method (Copenhaver, G.P., et 
al., 1994, The RNA polymerase I transcription factor UBF is a sequence-tolerant 
HMG-box protein that can recognize structured nucleic acids, Nucleic Acids 
Research 22:2651-2657). A consensus sequence larger than the minimal CpG was not 

20 likely to result from this selection process, because genomic sequencing of 5-'"C 
reveals that the enzyme methylates many CpG contexts in vivo. 

The screening method employed herein efficiently identified aDCMTase-induced 
population drift from 33.3% guanosine in the starting randomization to 50.0% in 

25 generation-!, 55.3% in generation-3 and finally 64.7% in generation-5. Randomized 
position 12 (see Figure 8) was enriched to 88% guanine in generation-5, suggesting 
that the total sequence space represented by the starting randomization was severely 
confined. Ultimately, the selection process did not disclose an obvious preferred 
sequence, but clearly a selection was evident. This is consistent with the observation 

30 that roughly 3x10^ CpG flanking sequence contexts in the murine genome undergo 
methylation in vivo. 

Sequence analysis of the 49 generation-5 members provided evidence that the 
DCMTase may bind these substrates in a preferred orientation. A greater guanosine 
35 selectivity was associated with the far 5 '-side of the CpG and a more divergent region 
was exposed from the -2 to the -5 positions. The 3 '-side of the invariant CpG 
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exhibits a different DCMTase preference; GpT and TpG dinucleotides occur more 
frequently and are often tandomly arranged. Empirically, the data do not allow for 
prediction of which strand may be poised to be methylated. The results with the 
mammalian DCMTase, which suggest sequence-dependent binding affects for a 26 

5 base— pair expanse (or more), are quite reasonable given the DNA footprinting results 
for the bacterial en2ymes mentioned. The binding asymmetry suggested by the results 
herein was likely induced by the design of the starting population, because one strand 
was guanine-rich while the other was cytosine-rich. This design was chosen in order 
to avoid introducing multiple CpG dinucleotides that could complicate the assessment 

10 of flanking sequence contributions around a single CpG. 

DCMTase interactions with DNA are influenced by helical geometries 

Dinucleotide analysis has been useful for understanding sequence-dependent 
conformational parameters of DNA (El Hassan, M.A. and Cailadine, C.R., 1996, 

15 Propeller-twisting of base-pairs and the conformational mobility of dinucleotide 
steps in DNA, J. Mol. Biol. 259:95-103; Hunter, C.A., 1993, Sequence-dependent 
DNA structure. The role of base stacking interactions, J. Mol. Biol. 230:1025-1054; 
Yanagi, K., et al., 1991, Analysis of local helix geometry in three B-DNA decamers 
and eight dodecamers, J. Mol. Bio. 217:201-214). The crystallography-derived 

20 parameters are generally similar for the protein bound and free states (Cailadine, C.R. 
and Drew, H.R., 1996, A useful role for "static" models in elucidating the behaviour 
of DNA in solution, J. Mol. Biol. 257:479-485). Dinucleotide conformational 
parameters have a limited range which are dependent on the two nucleotides 
immediately flanking the dinucleotide step in question. Because there are 136 four- 

25 base steps, the understanding of the sequence-dependent helix geometry at this level 
is still incomplete (El Hassan & Cailadine, 1996, supra; Yanagi et aL, 1991, supra). 
More distant nucleotides also have significant effects on CpG helical parameters 
(Lefebvre, A., et al., 1996, Solution structure of the CpG containing d(CTTCGAAG)2 
oligonucleotide: NMR data and energy calculations are compatible with a BI/BII 

30 equilibrium at CpG. Biochemistry 35:12560-12569). 

These analyses provide the basis for a qualitative interpretation of DNA 
conformational features important for the stabilization of the initial DCMTase:DNA 
complex. Guanosine-rich stretches, best represented in this Example by the GC-box 
35 and the selected 5 '-regions, often assume an A-DNA conformation. Guanosine-rich 
helices are under-wound because neighboring guanine bases tend to overlap and lead 
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to low dinucleotide twist angles (McCall, M., et al., 1985, The crystal structure of 
d(G-G-G-G-C-C-C-C). A model for poly (dG):poly(dC). J. Mol. Biol. 183:385- 
396; Yanagi et al, 1991, supra; El Hassan & Calladine, 1996, supra). Also, base-pair 
slide is allowed more freely in GpG than other steps, due mainly to a low propeiler- 
5 twist parameter. A-DNA thus differs from B-DNA in that the minor groove is wide 
and shallow while the major groove is narrower and deeper. While little is known 
about the DCMTase:DNA interface, the enzyme contains the peptide motif 
^^^SPKK^^^, which is found in proteins known to interact with the minor groove of 
DNA (Churchill, M. & Suzuki, M, 1989, "SPKK" motifs prefer to bind to DNA at 

10 A/T-rich sites, EMBO J. 8:4189-4195). The preference for sequences which have A- 
DNA like features may be due to DCMTase:DNA interactions mediated by this motif 
at some distance from the protein elements involved in CpG recognition. The GpT 
and TpG dinucleotide repeats, observed more frequently in the DCMTase selected 3 '- 
flank, have unique sets of conformational parameters that can increase helical 

1 5 flexibility (Nagaich, A.K., et al., 1 994, C A/TG sequence at the 5 ' end of oligo(A)- 
tracts strongly modulates DNA curvature, J. Biol. Chem. 269:7824-7833; Beutel, 
B.A. & Gold, L., 1992, In vitro evolution of intrinsically bent DNA, J. Mol. Biol. 
228:803-812; Lyubchenko, Y.L., et al., 1993, CA runs increase DNA flexibility in the 
complex of 1 Cro Protein with the Or3 site. Biochemistry 32:4121-4127; Haniford, 

20 D.B & Pulleybank, D.E., 1 983, Facile transition of poly [d(TG):d(CA)] into a left- 
handed helix in physiological conditions. Nature 302:632-634). 

Like the TpG step, CpG is considered "malleable" because the local conformations 
are dependent on flanking base-pairs (Lefebvre, A., et al., 1996, Biochemistry 

25 3 5:12560-12569; Lefebvre, A. Maxiffet, O., Hartmann, B., Lescot, E. & Fermandjian, 
S., 1995, Biochemistry 34:12019-12028; Hunter, C.A., 1993, J. Mol. Biol. 230:1025- 
1054; Prive, G.G., et al., 1991, J. Mol. Biol. 217:177-199; Grzeskowiak, K., et al., 
1991, J. Biol. Chem. 266:8861-8883). Severe effects on the geometrical parameters 
associated with a centrally located CpG have been measured for at least 15 different 

30 sequences. The structures of two oligonucleotides containing the consensus CRE 

element, TGACGTCA, have been determined (Mauffet, O., et al., 1992, J. Mol. Biol. 
227:852-875; Konig, P. & Richmond, T.J., 1993, J. Mol. Biol. 233:139-154). Several 
sequences closely related to the GC-box consensus, GGGGCGGGGC (SEQ ID 
N0:3), have also been crystallized. 
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A small twist angle is characteristic of CpG embedded in guanineycytosine-rich 
sequences and likely adds to the overall A-DNA character (Haran, T.E., et al., 1 987, 
The crystal structure of d(CCCCGGGG): A new A-form variant with an extended 
backbone conformation, J. Biomol. Struct. Dynam. 5:199-217; Heinemann, U., et al., 
5 1 987, Crystal structure analysis of an A-DNA fragment at 1 .8A resolution: 
d(GCCCGGGC), Nucl. Acids Res. 15:9531-9549; Rabinovich, D., et al., 1988, 
Structures of the mismatched duplex d(G-G-G-T-G-C-C-C) and one of its 
Watson-Crick analogues d(G-G-G-C-G-C-C-C), J. Mol. Biol. 200:151-161; 
Verdaguer, N., et al., 1991, Molecular structure of a complete turn of A-DNA, J. Mol. 

10 Biol. 221 :623-635; Frederick, C.A, et al., 1989, Molecular structure of an A-DNA 
decamer d(ACCGGCCGGT), Eur. J. Biochem. 181:295-307; Conner, B.N., et al., 
1984, Helix geometry and hydration in an A-DNA tetramer: CCGG. J. Mol. Biol. 
174:663-695; McCall etal., 1985, The crystal structure of d(G-G-G-G-C-C-C-C). 
A model for poly(dG):poly(dC), J. Mol. Biol. 183:385-396). Conversely, 

15 adenine/thymine-rich flanking sequences can lead to negative roll and high twist 
values at the CpG, so that the helix conforms more to B-DNA (Lefebvre, A., et aL, 
1996, Solution structure of the CpG containing d(CTTCGAAG)2 oligonucleotide: 
NMR data and energy calculations are compatible with a BI/BII equilibrium at CpG, 
Biochemistry 35:12560-12569; Mauffet, O., et al., 1992, The fine structure of two 

20 dodecamers containing the cAMP responsive element sequence and its inverse, J. 
Mol. Biol. 227:852-875; Grzeskowiak, K., et al., 1991, The structure of B-helical 
CGATCGATCG (SEQ ID N0:6) and comparison with CCAACGTTGG (SEQ ID 
N0:4), J. Biol. Chem. 266:8861-8883; Prive, G.G., et al., 1991, Structure of the B- 
DNA decamer C-C-A-A-C-G-T-T-G-G (SEQ ID NO:7) and comparison with 

25 isomorphous decamers C-C-A-A-G-A-T-T-G-G (SEQ ID NO:5) and C-C-A- 
G-G-C-T-G-G, J. Mol. Biol. 217:177-199; Bingman, C.A., et al., 1992, Crystal 
and molecular structure of the A-DNA dodecamer d(CCGTACGTACGG (SEQ ID 
NO:8)), J. Mol. Biol. 227:738-756). The backbone torsion angles that connect the 
cytidine and guanosine residues in these structures are particularly interesting. The 

30 large slide associated with extensive inter-strand guanine stacking tends to stretch and 
contort the a and g torsion angles into the Bn conformation (Haran, T.E., et al., 1987, 
The crystal structure of d(CCCCGGGG): A new A-form variant with an extended 
backbone conformation, J. Biomol. Struct. Dynam. 5:199-217; Rabinovich et al., 
1988, supra; Lefebrve et al., 1996, supra; El Antri, S., et al., 1993, Structural 

35 deviations at CpG provide a plausible explanation for the high frequency of mutation 
at this site, J. Mol. Biol. 230:373—378). Bn has an unusual trans, trans arrangement of 
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a and g torsion angles that is most often associated with A-DNA. Bn may be more 
readily attained by a CpG with guanine/cytosine-rich flanking sequences than with 
adenine/thymine-rich ones. Mechanically speaking, the Bn conformation allows for a 
crankshaft motion to modulate a destacking of bases (Haran et al, 1987, supra). This 
5 is likely an early event in the base flipping process mediated by DNA 

methyltransferases(Allan, B.A. & Reich, N.O., 1996, Targeted base stacking 
disruption by the EcoRI DNA methyltraasferase. Biochemistry 35:14757-62). 

The functional importance of the CpG phosphate orientation and flexibility, and 

10 DCMTase:phosphate interactions in general, have been studied using the 
M./ftaI:DNA cocrystal structure (Klimasauskas, S., et al., 1994, Hhal 
methyltransferase flips its target base out of the DNA helix. Cell 76:357-369; Cheng 
X; Blumenthal RM., 1996, Finding a basis for flipping bases. Structure 4:639-645). 
This structure has the target cytosine positioned outside of the helical cylinder 

15 covalently trapped by the enzyme. Surprisingly few contacts are made directly with 
the bases and extensive interactions with the backbone are asymmetrically located 
around the extrahelical cytosine. Interactions with the two phosphates on the 5 '-side 
of this cytosine appear to be particularly important (S'-^pG^pC^pG^pC^p-S') and only 
phosphates 2 through 5 show several angstrom displacement when compared to the 

20 uncomplexed DNA. The peptide regions which contact the phosphates are conserved 
among numerous bacterial cytosine DNA methyltransferases (Cheng & Blumenthal, 
1996, supra). For M.Hhal, phosphate ^p is contacted by Arg*^^ and Ser^^, and 
sequence alignment suggests that Arg'^^^ and Ser'^'^"' may play analogous roles in the 
mammalian DCMTase. Also, Arg^^ which contacts ^p and Lys^° which contacts *p in 

25 M.Hhal have homologous residues in the DCMTase, namely Lys^^"*^ and Arg*^^''. 

The murine DCMTase has a DNA binding specificity that is similar to the catalytic 
specificity. The preference of the enzyme for guanine/cytosine-rich sequences may 
reflect a preferred positioning of backbone phosphates within the DCMTase:DNA 

30 complex. DCMTase may use the specificity advantage in localizing to certain 

genomic regions or to preferentially methylate guanine/cytosine-rich DNA in vivo. 
The function of methyiation in bacteria as a primitive immune system, may be a major 
function for the eukaryotic methyltransferases. Many human viruses are very 
guanine/cytosine-rich and the discrimination we identified may aid in the specific 

35 deactivation of infected viral DNA. 



wo 99/12027 PCT/US98/12351 

-39- 

Example 2: Kinetic Mechanism and Identification of a Potent Inhibitor of Murine 
DNA Cvtosine-C^ Methvltransferase 

This example provides four types of steady-state kinetic analyses to identify the order 
of substrate addition to the enzyme and the order of product release. In addition, this 
5 example identifies a potent single-stranded DNA inhibitor of DCMTase. 



Materials 

S'-adenosyl-L-[methyl-3H]methionine {75 Ci/mmol, 1 mCi/ml, 1 Ci=37 GBq) was 
from Amersham Corporation. Unlabeled AdoMet (Sigma Chemical Company, St. 

10 Louis, MO) was further purified as described (Flynn, J., et al., 1 996, Biochemistry 
35:7308-73 15). Routinely, 125 mM AdoMet stocks were prepared at a specific 
activity of 5.8 x 10^ cpm/pmol. Two lots of poly(dI-dC:dI-dC) were purchased from 
Pharmacia Biotech, Inc. (Piscataway, NJ) with an average length of 6250 and 5000 
base pairs. DE81 filters were purchased from Whatman, Inc. Other standard chemicals 

1 5 and reagents were purchased from Sigma Chemical Company or Fisher Scientific 
(Hampton, New Hampshire). 

DNA cytosine C-5 methyltransferase was purified from mouse erythroleukemia cells 
as described (Xu, G., et al, 1995, Purification and Stabilization of mouse DNA 
20 methyltransferase, Biochemi.Biophysi.Res.Communi 207:544-55 1). Two separate 
preparations, with concentrations of 380 nM and 260 nM, were confrrmed to have 
equivalent activities with the subsfrates studied. 

DNA Substrate Preparation 

25 The following three oligonucleotides mimic the GC-box transcriptional cis~ 

regulatory element, in bold type, and were prepared as described (Flynn et aL, 1996, 
supra). The central CpG is underlined. 

GC-box a: 5'-GGGAATTCAAGGGGCGGGGCAAGGATCCAG-3' 
30 (SEQ ID NO:9) 

GC-box b: 5'-CTGGATCCTTGCCCCGCCCCTTGAATTCCC-3' (SEQ 

ID NO: 10) 

GC-box bMET; 5'-CTGGATCCTTGCCC°>CGCCCCTTGAATTCCC-3' 
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Steady- State Kinetic Assays 

Duplicate 25 \xL reaction volumes contained 3.0 nM DCMTase and 10 \iM AdoMet in 
MR buffer (100 mM Tris-HCl, pH 8.0, 10 mM EDTA, 200 [ig/ml BSA, 10 mM 
DTT). After preincubation at room temperature for up to 1 0 minutes, reactions were 
5 initiated by the addition of poly(dI dC:dI dC) and, if indicated, inhibitor DNA or 
reaction products. In several experiments it was found that initiating a reaction 
containing DNA with AdoMet yielded similar results to those routinely used. Single- 
stranded DNA was heated to 90 °C and quick cooled on ice, prior to initiation of the 
reaction. Freeze-thawed DNA produced equivalent results. Incubations were at 37 °C 

10 for 60 minutes. The poly(dI-dC:dI-dC) concentrations were 2.0, 4.0, 8.0, 16, 35, 80, 
160, 250, 400, 700 and 1000 pM. In some experiments, GC-box a/b was the 
substrate, using 100 nM DCMTase and DNA concentrations of 0.20, 0.40, 1.0, 2.0, 
4.0, 8.0, 15, 23 and 35 fxM. The reaction was stopped after 60 minutes by transferring 
20 |iL of the reaction onto a DE 81 filter paper that was processed as described (Flynn 

15 et al., 1996). The radioactivity above the background, determined from assays without 
added poly(dI dC:dI dC), was converted to initial velocities and expressed as 
picomoles of methyl groups transferred to poly(dI-dC:dI-dC) per hour and plotted in 
double reciprocal form. The substrates poly(dI-dC:dI dC) and AdoMet, competitor 
DNA and reaction product concentrations were varied as indicated in the Figures. 

20 

For double reciprocal plots of velocity versus substrate concentration (Figures 12A & 
12B), reactions contained 20 nM DCMTase in 100 mM Tris pH 8.0, 10 mM EDTA, 
10 mM DTT, 200 ng/mL BSA. Incubations were at 37 °C for 60 minutes. For Figure 
12A, poly(dI-dC:dI-dC) was varied at: triangles, 4 p,M; squares, 2 |j,M; diamonds, 
25 1 nM; circles, 0.5 \xM, while AdoMet was constant. For Figure 12B, AdoMet was 

varied at: triangles, 112 pM; squares, 56 pM; diamonds, 28 pM; circles, 14 pM, while 
poiy(dI-dC:dI dC) concentration remained constant. 

For the double reciprocal plot of velocity versus poly(dI-dC:dI-dC) with varymg GC- 
30 box b concentrations (Figure 13), reactions contained 3.0 nM DCMTase and 10 ^iM 
AdoMet m 100 mM Tris pH 8.0, 10 mM EDTA, 10 mM DTT, 200 (ig/mL BSA. The 
poly(dI-dC:dI-dC) concentrations were 10, 13, 20, 40 and 100 pM. The GC-box b 
concentrations were: diamonds, 0; circles, 0.75 j4,M; triangles, 1.5 |j,M; squares, 5.0 
jxM. Incubations were at 37 °C for 60 minutes. 
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For the double reciprocal plot of velocity vs. poly(dI dC:dI-dC) with varying GC-box 
bMET concentrations (Figure 14), reactions contained 2.0 nM DCMTase and 10 jiM 
AdoMet in 100 mM Tris pH 8.0, 10 mM EDTA, 10 mM DTT, 200 p.g/mL BSA. The 
poIy(dI dC:dI dC) concentrations were 1.5, 3.0, 7.5, 15 and 20 pM. The GC-box 
5 b^ET concentrations were: squares, 0; circles, 10 nM; diamonds, 20 nM; triangles, 
40 nM. Incubations were at 37 °C for 60 minutes. 

For the double reciprocal plot of velocity versus AdoMet with varying GC-box b*^^^ 
concentrations, (Figure 15), reactions contained 4.0 nM DCMTase and 50 pM 
10 poly(dI dC:dI dC) m 1 00 mM Tris pH 8.0, 10 mM EDTA, 10 mM DTT, 200 jxg/mL 
BSA. The AdoMet concentrations were 0.75, 1.5, 3.0 and 6.0 ^M. The GC-box b'^'^^ 
concentrations were: squares, 0; circles, 20 nM; diamonds, 40 nM; triangles, 80 nM. 
Incubations were at 37 °C for 60 minutes. 

15 For the double reciprocal plot of AdoHcy product inhibition with varying AdoMet 
concentrations (Figure 16), reactions contained 20 nM DCMTase and 40 pM 
poly(dI-dC:dI-dC) in 100 mM Tris pH 8.0, 10 mM EDTA, 10 mM DTT, 200 ng/mL 
BSA. The AdoMet concentrations were 0.50, 1.0, 2.0, 4.0 and 8.0 ^iM. The AdoHcy 
concentrations were: squares, 0; diamonds, 0.75 ^iM; circles, 1.5 \iM; triangles, 3.0 

20 ixM; notched squares, 6.0 )xM. Incubations were at 37 °C for 60 minutes. 

For the double reciprocal plot of AdoHcy product inhibition with varying 
poly(dI-dC:dI dC) concentrations (Figures 17A-C), reactions contained 20 nM 
DCMTase in 100 mM Tris pH 8.0, 10 mM EDTA, 10 mM DTT, 200 \ig/mL BSA. 

25 Incubations were at 37 °C for 60 minutes. The poly(drdC:dI dC) concentrations were 
2.5, 5.0, 10, and 20 pM. The AdoHcy concentrations were: squares, 0; diamonds, 15 
fxM; circles, 30 ^.M. For Figure 17 A, AdoMet was held constant at 1.2 ^iM. For Figure 
17B, AdoMet was held constant at 8 jxM. Figure 17C shows secondary slope replots 
from another series of experiments in which the AdoMet concentrations were: circles, 

30 6.3 \xM; diamonds, 2.5 \iM; squares 1 \iM. 

For the double reciprocal plot of poly(dId'"C:dId'"C) product inhibition with varying 
AdoMet concentrations (Figure 1 8), reactions contained 20 nM DCMTase and 60 pM 
poly(dI dC:dI dC) in 100 mM Tris pH 8.0, 10 mM EDTA, 10 mM DTT, 200 |xg/mL 
35 BSA. The AdoMet concentrations were 1.0, 2.0, 4.0 and 8.0 ^M. The 

poly(dId'"C:dId'"C) concentrations were: squares, 0; diamonds, 5.0 pM; circles, 10 
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pM; triangles, 20 pM. Incubations were at 37 °C for 60 minutes. Experimental data 
are shown scattered around lines derived from a fit to equation 5 for noncompetitive 
inhibition. 

5 For the double reciprocal plot of poly(dId'"C:dId'"C) product inhibition with varying 
poly(dI-dC:dI dC) concentrations (Figures 19A-B), reactions contained 20 nM 
DCMTase and 1.5 AdoMet in 100 mM Tris pH 8.0, 10 mM EDTA, 10 mM DTT, 
200 ng/mL BSA. Incubations were at 37 °C for 60 minutes. The poly(dI dC:dI dC) 
concentrations were 20, 40, 80, 120 and 160 pM. The poly(dId"'C:dId'"C) 
10 concentrations were: squares, 0; triangles, 34 pM; circles, 45 pM; diamonds, 68, 
notched squares, 90 pM. Experimental data are shown scattered around lines. In 
Figure 19A, lines are derived from a fit to equation 4 for competitive inhibition. In 
Figure 19B, lines are derived from a fit to equation 5 for noncompetitive inhibition. 
Both fittings are not acceptable. 

15 

For initial velocity plots of different poly(dIdC:dIdC) lengths (Figure 20), reactions 
contained 20 nM DCMTase and 10 jiM AdoMet in 100 mM Tris pH 8.0, 10 mM 
EDTA, 10 mM DTT, 200 [xg/mL BSA. Incubations were at 37 °C for 60 minutes. The 
poly(dI dC:dI dC) sizes were: circles, 100 base-pairs; diamonds, 500 base-pairs; 
20 triangles, 2000 base-pairs; squares, 5000 base-pairs. The inset provides a zoom in 
along the x— axis toward the origin to show the quality of the data. 

Fragmentation of poly(dI dC:dI dC) 

Sonication was used to break a 5000 base-pair average length poly(dI dC:dI dC) to 
25 lengths of approximately 2000, 1400, 600, 500 and 100 base-pairs using a Branson 
Sonifier 450 with a microbore tip. Lengths were estimated by agarose gel 
electrophoresis using DNA size standards. 

Preparation of poly(dld"C:dI<rC) 

30 Poiy(dI dC:dI dC) was methylated to completion with M..Sss\ (New England Biolabs). 
The methylation reaction was optimized and the apparent IQn'^^'^ was determined to be 
0.40 nM for M.55'5l using 6250 base-pan poly(dI dC:dI-dC). For reaction efficiency 
and sufficient yields, a 500 |aL reaction contained 1.0 nM poly(dI dC:dI-dC). AdoMet 
was added to 1 00 ^iM to provide an excess level of methyl-groups to complete the 

35 reaction. Three 20 unit aliquots of M.&sl were added every 10 hours in MR buffer. 
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After cleaning the DNA by standard organic extraction methods, it was resuspended 
to 10 nM in TE (lOmM Tris pH 8.0, ImM EDTA) and subjected to the metihylation 
reaction using M.^*! and radiolabeled AdoMet. Background, 230 cpm, was detected 
with this preparation at 0.80 nM and a similar control experiment with 0.80 nM 
5 unmethylated poly(dI-dC:dI-dC) generated 37,000 cpm. The methylated DNA, 

poly(dI-d'"C:dI-d"'C), was resistant to digestion by Hhal endonuciease and the control 
DNA was digested to small fragments, as determined by agarose gel electrophoresis. 

Isotope Partitioning Analysis 

10 A pre-steady-state approach was used to determine the catalytic competency of the 
DCMTase: AdoMet complex. The complex was formed at 37 using 20 nM 
DCMTase and tritiated AdoMet at a concentration of 10 \xM. The reaction was 
initiated by adding a mixture of 400 pM poly(dI-dC:dI-dC) and 100 mM unlabeled 
AdoMet. After a one hour incubation at 37 "C, the reactions were treated as stated 

15 above. 

Molecular Partitioning Analysis 

A pre-steady-state approach was used to determine the catalytic competency of the 
DCMTase:DNA complex. Two different sizes of substrate DNA, 1400 and 600 base- 

20 pair poiy(dI-dC:dI-dC), were used to distinguish if the initial complex proceeded in the 
forward direction or dissociated before DCMTase performed chemistry. The complex 
was formed at 37 °C for 1.5 minutes with 5 nM DCMTase and the 1400 base-pair 
poly(dI-dC:dI-dC) at 0.20 nM, then a mixture containing 2.0 jiM tritiated AdoMet 
(neat stock concentration, 13 m-M) plus an excess of the molecular competitor, 600 

25 base-pair poly(dI-dC:dI-dC) at 5.0 nM was added to initiate catalysis. Aliquots were 
removed at 1.5, 3 and 9 minutes followed by centrifugation through a P-6 spin 
column (Bio-Rad) to trap unincorporated label. DNA were separated on an 6% 
polyacrylamide, 8M urea gel run at 400 V for 4.5 hours. Standard methods of 
fluorography were used v^rith LiquiScint (National Diagnostics) as the fluor. The dried 

30 gel was exposed to Fuji XAR film for three months at -70 C. 

Data Analysis 

The Michaelis-Menton equation was used for studies into DNA length contributions 
to catalysis using KaiiedaGraph 2. 1 .2 (Synergy Software). For mechanistic 
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determinations, the nomenclature used is that of Cleland, W.W., 1963a. Biochimi. 
Biophysi. Acta 67:104-137. Ail steady-state data were analyzed using regression 
analyses of the appropriate initial velocity equation, listed below, using the Cleland 
programs (Cleland, W.W., 1979, Statistical analysis of enzyme kinetic data, Methods 
5 in EnzymoL 63:103-138). 

Substrate Inhibition: 

V = VA (1) 
10 Ka + A + A^/Ki 

Sequential Mechanism: 

v= VAB (2) 
15 KiaKb + KaB + KbA + AB 

Ping Pong Mechanism: 

v= VAB (3) 
20 KaB + KbA + AB 

Competitive Inhibition: 

v= VA (4) 
25 Ka(l + Kis) + A 

Noncompetitive Inhibition: 

v= VA (5) 
30 Ka(l +Kis) + A(H-Kii) 

Uncompetitive Inhibition: 



V = VA (6) 
35 Ka+ A(l + Kii) 
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The algorithms perform a non-Unear least squares fit to the entire data set. 
Mechanistic determinations were made by comparison of the sigma values associated 
with the fit to each equation. The standard errors associated with fitted parameters and 
5 graphical analysis of the experimental data scattered around the calculated best fit 
lines were also considered m making an assignment. 

Preparation of poly(dI-crC:dId^C) 

Poly(dI-dC:dI-dC).was methylated to completion with M.&jI (New England Biolabs, 
10 Beverly, Massachusetts). Optimization of the methylation reaction was investigated 
and the apparent K^'*^ was determined to be 0.4 nM for M.^'s.sl. For reaction efficiency 
and sufficient yields, a 500 mL reaction contained 1 nM poly(dI-dC:dI-dC). AdoMet 
was added to 100 |iM to provide an excess level of methyl-groups to complete the 
reaction. Three 20 unit additions of M-fe^I were done every 10 hours in our 
1 5 methylation buffer. Complete methylation of poly(dI-dC:dI-dC) was tested. After 
cleaning the DNA by standard methods, it was resuspended to 10 nM in TE and 
subjected to the methylation reaction using yiSssl and radiolabeled AdoMet. 
Background, 230 cpm, was detected with this preparation at 0.8 nM and a control 
experiment generated 37,000 cpm. The methylated DNA, poly(dI-d"'C:dI-d"'C), was 
resistant to digestion by Hhal endonuclease and the control DNA was digested to 
small fragments, determined by agarose gel electrophoresis. 

Fragmentation of poly(dIdC:dIdC) 

Sonication was used to break a 5000 base-pair average length poly(dIdC:dIdC) to 
sizes of approximately 2000, 500 and 100 base-pairs. The sizes were determined by 
agarose gel electrophoresis and comparison to DNA size standards. 

Results 

The DNA substrate used in the steady-state studies was poly(dI-dC:dI-dC). This 
substrate contains tandem methylation sites in which guanosine has been replaced by 
inosine. Methylation is catalyzed at a higher rate with this substrate than with other 
DNA (Flynn, J., et al. (1996) Biochemistry 35:7308-7315; Pedrali-Noy, G., & 
Weissbach, A., (1986) J. Biol. Chem. 261:1, 7600-77602). Poly(dI-dC:dI-dC) 
provides a uniform sequence and limits the potential complexities found with large 
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cloned sequences that contain many randomly situated CpG dinucleotides, each 
having different flanking sequence contributions to binding and catalysis (Flynn et al, 
1996 supra). 

5 Substrate Inhibition 

Linn and coworkers reported that high concentrations of large DNA resulted in 
DCMTase inhibition, and proposed that multimeric forms of the enzyme are required 
for activity (Hitt, M.M., et al, 1988, J. Biol. Chem. 263:4392-4399). To distinguish 
between this and alternative explanations, a short 30 base-pair DNA substrate was 
tested for inhibition at high concentrations. Figure 1 1 shows the initial velocity results 
for poly(dI-dC:dI-dC), 6250 base-pairs, and GC-box a/b in terms of reduced 
concentrations (S/Kn,^^^). Initial velocity data for both substrates were fit to equation 
1, which is a standard equation for analyzing substrate inhibition. ¥^^^ was 
determined to be 5.5 +/- 0.9 pM and 0.31 +/- 0.13 \iM, and Ki°^^ was 1010 +/- 170 
pM and 43 +/- 22 ^iM for poly(dI-dC:dI-dC) and GC-box a/b, respectively (Table 3). 
In both cases, DNA concentrations greater than 20 times K^^^^^ caused substrate 
inhibition. AdoMet utilization was calculated to be less than 0.5%. Product mhibition 
by ^-adenosyl homocysteine formation is therefore unlikely. AdoMet substrate 
inhibition was not observed at concentrations up to 30 times kJ^*^^^^. The substrate 
inhibition observed implicates a second DNA molecule binding to the enzyme and 
inhibiting catalysis. 

Table 3: Substrate Inhibition Constants^ 

GC-box a^ 43000 +/- 22000 nM 

Poly(dIdC:dIdC) 1 .0 +/- 0.2 nM 

* The constants reported, K; in equation 1, were derived from non-linear regression to 
the appropriate rate equations as described above. The nomenclature is that of 
Cleiand (1963b). 

Initial Velocity Studies with Poly(dLdC:dldC) and AdoMet 
Double reciprocal plots of initial velocity versus the substrate concentrations are 
shown in Figure 12. DNA concentrations near Km were used to avoid non-Michaelis 
behavior (see substrate inhibition studies above). The transformed data were best fit 
by lines intersecting left of the>^axis using a non-linear regression of equation 2, a 
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standard equation for analyzing the steady-state mechanism. The true MichaeHs 
constants derived were Km"'"''*^ = 36 +/- 5 pM and Km^'io^^" = 1 .4 +/- 0.2 |liM. The 
intersecting patterns rule out a nonsequential mechanism and implicate a sequential 
order of substrate addition in which both DNA and AdoMet add to the enzyme surface 
5 before products are released. However, a prudent assignment of a kinetic mechanism 
requires additional kinetic arguments. 

Dead- End Inhibition with Single- Stranded DNA 

A previous kinetic study showed no detectable enzyme activity with single-stranded 

10 GC-box a and GC-box b (Flynn, J., et al., 1996, Murine DNA cytosine-C5 

methyltransferase: Pre-steady- and steady-state kinetic analyses with regulatory 
DNA sequences, Biochemistry 35:7308-7315). In contrast, the DCMTase binds these 
same oligonucleotide substrates with affinities comparable to those of other DNA (see 
Example 1). For these reasons it was presumed that single-stranded GC-box 

1 5 substrates could act as dead-end inhibitors of the reaction with poly(dLdC:dI-dC). 

Dead-end inhibitors can provide a strong methodology for assessing whether a kinetic 
mechanism is random or ordered. Inhibition of poly(dI-dC:dI-dC) methylation by 
single-stranded GC-box b was studied at 1 5 \\M AdoMet. The data were best fit by 
equation 5, a standard equation for noncompetitive inhibition, and generated the 

20 intersecting double reciprocal pattern shown in Figure 13. The inhibition constants 
were determined to be Kis = 3.6 +/- 1.5 and Kii = 6.8 +/- L2 }iM (Table 4). Kii 
is the inhibition constant associated with an intercept effect and Kis is the inhibition 
constant associated with a slope effect from families of double reciprocal plots. An 
alpha factor, Kij/Kis, of 1 .9 was determined and suggests that the partitioning of this 

25 inhibitor slightly favors addition to the free enzyme over the 
DCMTase:poly(dI-dC:dI dC) intermediate. 
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Table 4: Dead-End Inhibition Constants and Mode of Inhibition^ 

DNA Inhibition Constant (nM) Mode of Inhibition 

GC-box b 3600 +/- 1 500*' Noncompetitive with poly (dIdC:dIdC) 
5 GC-box b 6800+/- 1200" Noncompetitive with poly (dIdC:dIdC) 

GC-box b^^'^ 20 +/- 3^ Uncompetitive with poly (dIdC:dIdC) 

GC-box b^^"^ 25 +/- 1 0* Competitive with Adomet 

*The constants reported were derived from non-linear regression to the 
appropriate rate equations as described above. The nomenclature is that of 
10 Cleland (1963b). 

""The inhibition constant refers to the slope derived Kii in equation 5. 
''The inhibition constant refers to the intercept derived Kj, in equation 5. 
^The inhibition constant refers to the intercept derived Kii in equation 6. 
®The inhibition constant refers to the slope derived Kii in equation 4. 

15 

The CpG methylated homolog of GC-box b, GC-box b"^^^, was studied for inhibition 
under similar conditions. The data were best fit by the log form of equation 6, a 
standard equation for uncompetitive inhibition. The inhibition constant, Kfi, was 
estimated to be 20 +/- 3 nM. The double reciprocal transformation is shown in Figure 
20 14. Remarkably, a single 5-"'C substitution appears responsible for a 200-fold lower 
inhibition constant and a change in the mode of inhibition. The uncompetitive nature 
of inhibition suggests that GC-box b"^^ and poly(dI-dC:dI-dC) bind to distinct sites 
on the DCMTase surface and that poly(dI-dC:dI-dC) binds prior to GC-box b^"^"^. 

25 Another characterization of the potent inhibition observed with GC-box b^^^ was 
obtained by varying AdoMet and GC-box b^^^ concentrations using a constant 50 
pM poly(dI-dC:dI-dC). The data from initial velocities were best fit to equation 4, 
which is a standard equation for competitive inhibition. The estimated inhibition 
constant was Kjs = 25 +/- 10 nM. The intersection of the fit lines on the 1 /velocity 

30 axis in Figure 1 5 suggests that GC-box b and AdoMet bind competitively to the 
same poly(dldC:dI-dC)-bound form of the enzyme. 

The two inhibition constants determined for GC-box b^^^ are in good agreement at 
about 20 nM. The patterns observed provide strong evidence for an ordered Bi-Bi 
35 kinetic mechanism with substrate DNA binding to the enzyme first and AdoMet 
binding second, followed by the release of products (Spector & Cleland, 1981, 
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Meanings of Ki for conventional and alternative-substrate inhibitors, Bio. Pharm. 
30:1-7). In the absence of poly(dI-dC:dI-dC), GC-box b^^"^ bound free en2yme with a 
120-fold lower affinity (see Example 1). 



5 Product Inhibition Studies 

Product inhibition studies were pursued to further identify the steady-state kinetic 
mechanism (Table 5). The DCMTase reaction product AdoHcy was a competitive 
inhibitor of AdoMet. The competitive nature of AdoHcy with respect to AdoMet 
binding, Kjs = 1 .4 +/- 0.2 piM, suggests that AdoMet and AdoHcy bind to the same 
10 form of the enzyme (Figure 16) or that the kinetic mechanism is Theorell-Chance. 
The Theorell-Chance kinetic mechanism is a simplification of the Ordered B-Bi 
scheme. 



Table 5: Product Inhibition of Murine DNA Cytosine-C^ Methyltransferase/ 

Product Varied Fixed Type of Inhibition 

Substrate Substrate Inhibition Constant'' 



rc IC AdoMet NC nd^ 

rc AdoMet IC NC Kis 5.3 +/- 2.1 pM 

Kii 30 +/- 12 pM 
AdoHcy IC AdoMet NC/UC Nd'' 

AdoHcy AdoMet IC C Kis 1 .4+/-0.2 ^M 



25 ^ rc, fully methylated poly(dIdC:dIdC); IC, poly(dIdC:dIdC); AdoHcy, S-adenosyl 

homocysteine; AdoMet, S-adenosyl methionine; C, competitive; NC, noncompetitive; 

UC, imcompetitive inhibition. 
Kis refers to the inhibition constant derived from a slope affect. Kii refers to the 

inhibition constant derived from an intercept affect. 
30 " nd, not determined. The determination of an inhibition constant may be complicated 

by binding to a second nucleic acid binding site on the DCMTase. 

^ nd, not detemined. The inhibition constants are dependent on the fixed AdoMet 

concentration and inhibition is not overcome by saturating AdoMet concentrations. 

Instead, the inhibition profiles are noncompetitive-like at low AdoMet and 
35 uncompetitive-like at high AdoMet concentrations. 
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A distinctive inhibition profile is revealed v^^ith varying AdoHcy and poly{dIdC:dIdC). 
Two families of plots were obtained with AdoMet at different constant levels, 1 .2 |liM 
and 8 ^iM (Figures 17A and 17B, respectively). Increasing the AdoMet concentration 
had the gradual effect of changing the plots from a noncompetitive-like pattern at 
5 concentrations near K^'^'*"'^^' to an uncompetitive pattern at higher AdoMet 

concentrations. There is a decrease in scale of the j^axis in Figure 17B compared to 
17A and a lack of a significant change of the data collected at high poly(dI-dC:dI-dC) 
concentrations, the points closest to the >^-axis. The data at low AdoMet 
concentrations fitted slightly less well to an uncompetitive model, K^- = 2.0 +/- 0.6 
fxM, than to a noncompetitive model that produced the constants K^^ = 63 +/- 71 |j,M 
and K,i = 2.5 +/- 1.0 p,M. The Kj'^''""'^^ was independently determined in Figure 15 to 
be 1 .4 ^iM. The slope andj^-intercept replots from each AdoHcy versus 
poly(dI-dC:dI-dC) series were all linear. Another study confirmed these results and 
showed a gradual effect of going from a noncompetitive to an imcompetitive model 
using three AdoMet concenfrations; 1, 2.5, and 6.3 nM. This analysis demonsfrates 
that the slope contribution to AdoHcy inhibition is minimal at low AdoMet 
concentrations, inhibition cannot be overcome by high AdoMet concenfrations, and 
AdoHcy binds to a different enzyme form than poly(dI-dC:dI-dC). This is sfrong 
evidence for an Ordered Bi Bi mechanism in which initial DNA binding is followed 
by AdoMet binding and that the following reaction step is irreversible. 

DCMTase:poly(dI-dC:dI-dC) + AdoMet = DCMTase:poly(dI-dC:dI-dC):AdoMet 

Also, the last product to leave the enzyme cannot be AdoHcy if poly(dI-dC:dI-dC) is 
the first substrate to bind DCMTase. Uncompetitive inhibition with AdoHcy and 
DNA was also observed with M.Hhal, it provided evidence that a 
M.///jf2l:DNA:AdoHcy complex can form and ruled out a catalytically significant 
M.//72«I: AdoHcy complex (Wu & Santi, 1987). 

Product Inhibition with Poly(dI(rC:dId"C) 

Fully methylated poly(dId"C:dId'"C) was prepared and used as a product inhibitor of 
the DCMTase reaction. Poly(dId'"C:dId'"C) was linear noncompetitive with AdoMet 
when poly(dIdC:dIdC) was held constant (Figure 18; Table 3). The estimated 
inhibition constants were Kis = 5.3 +/- 2.1 pM and Kii = 30 +/- 12 pM. The 
noncompetitive pattern with AdoMet supports many different mechanisms including 
one in which DNA binding occurs prior to AdoMet binding. 
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The double reciprocal pattern for methylated DNA product versus DNA substrate 
would be expected to be competitive in a standard ordered Bi-Bi kinetic mechanism 
where DNA adds first and methylated DNA leaves last from the catalytically 
5 competent enzyme surface. The double reciprocal data obtained on five experiments 
appeared to be noncompetitive. However, when subjecting the data to fitting by the 
competitive and the noncompetitive models, graphical analysis showed that fitting in 
both cases was not acceptable (Figure 19). This was true for plots obtained with 
AdoMet concentrations held constant from 1.25 to 12.5 ^M. The sensitivity of 

1 0 inhibition was notably abrupt, as poly(dI(rC:dId"'C) had little effect at 10 pM and 
completely inhibited the reaction at 100 pM. The results from one experiment are 
shown in Figures 23A-C with idealized lines intersecting left of they-axis. 
Secondary slope andjr-intercept replots (Figures 23B and 23 C) were obtained and 
both were parabolic concave upward. This explains the difficulty in fitting the simple 

15 model and is indicative of poly(dI-d"'C:dI-d™C) binding at two points in the catalytic 
cycle. Fiirthermore, it is evidence that a DCMTase:DNA:DNA complex can be 
formed. Additional steady-state kinetic experiments also support the existence of an 
inhibitory DCMTase:DNA:DNA complex. 

20 Isotope Partitioning Analysis with AdoMet 

Isotope partitioning analysis is a powerful strategy used to identify catalytically 
competent enzyme: substrate complexes (Rose, 1980; Reich & Mashhoon, 1991). The 
DCMTase: AdoMet complex formed with 10 \iM radiolabeled AdoMet was not 
competent for catalysis, because a chase including 400 pM poly(dLdC:dI-dC) and 100 

25 |.iM unlabeled AdoMet produced no detectable activity. Substrate inhibition was not 
observed at high AdoMet concentrations. This is typical for an Ordered Bi Bi 
mechanism when studying the second substrate by isotope partitioning, because the 
DCMTase:AdoMet complex must dissociate before DCMTase can bind 
poly(dI-dC:dI-dC). Under these conditions, the DCMTase:poly(dI-dC:dI-dC) complex 

30 would then bind a diluted specific activity AdoMet and catalysis would not be 
detectable. 



Molecular Partitioning Analysis 



A novel assay was developed to test the competency of the initial 
35 DCMTase:poly(dI-dC:dI-dC) complex. This complex was formed with one DNA 
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length, 1400 base-pairs, and then challenged with an excess of smaller, 600 base-pair 
DNA combined with AdoMet. The initial DCMTase:poiy(dI-dC:dI-dC) complex was 
observed to be competent for catalysis, because tritium was incorporated into the 
larger DNA. A control experiment allowed both DNA lengths to compete for 
5 DCMTase binding before AdoMet was added, because the smaller DNA was at a 
sufficiently higher concentration all of the detectable label was incorporated into it. 
This demonstrates that DNA, under the conditions employed, can bind first in the 
steady-state mechanism, and limits the assumptions made in other experiments to the 
Ordered Bi Bi mechanism. 

10 

DNA Length Contributions to Catalytic Efficiency 

Sonication was used to break a 5000 base-pair average length poly(dIdC:dIdC) to 
sizes of approximately 2000, 500 and 100 base-pairs. Initial velocity profiles were 
obtained for each size (Figure 20), and the kinetic terms are compared in Table 6. A 

15 14-fold increase in Km°^^ was observed as the length decreased 5000 to 1 00 base- 
pairs, but kcat only dropped by one-third. The hyperbolic trend in specificity constants, 
kcat/Kni^^'^ (Figure 21), suggests a half maximal length of 1200 base-pairs and that 
lengths greater than 2000 base-pairs provide little advantage to catalytic specificity. 
On the contrary, DNA lengths of 500 base-pairs and smaller show a very sharp 

20 decrease in specificity. DNA length is thereby critical to maximal performance of 
DCMTase. 



Table 6: Poly (dIdC:dIdC) length and Catalytic Efficiency'' 



Length Vmax (fmol hr~') kcat (hr"') Km (pM) Kcat/Kin(hr~' M"' x lO"*) 

5000 12500+/- 1100 31.2 140+/- 30 22.2 

2000 9950+/- 390 24.9 125+/- 17 19.9 

500 9120+/- 290 22.8 300+/- 30 7.65 

100 8600+/- 210 21.5 1890+/- 150 1.13 

^Constants were determined from initial velocity analysis using the Michaelis-Menton 
equation. 



35 



Reciprocal plots of both substrates, AdoMet and 100 base-pair poly(dIdC:dIdC), were 
generated. The patterns observed were much like that shown in Figure 12 with 6250 
base-pair poly(dIdC:dIdC). Although large effects in the kinetic terms were observed 
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with decreasing poly(dIdC:dIdC) length, the mechanism of catalysis does not appear 
to be affected. 

Discussion 

5 The data presented herein clarify some of the basic aspects of how cytosine C-5 

methylation is catalyzed and perhaps controlled in eukaryotes. The order of substrate 
binding appears to be DNA followed by AdoMet and the order of product release 
appears to be AdoHcy followed by methylated DNA. Three kinetic methodologies 
support our assignments: initial velocity studies varying both substrates, dead-end 
10 inhibition, and product inhibition. DNA substrate inhibition was common to both 

small, single CpG containing DNA and large, multi-site DNA. A second nucleic acid 
binding region on the DCMTase, distinct from the active site, is implicated from both 
the substrate inhibition and the dead-end inhibition studies. 

15 DCMTase Multimerization and Substrate Inhibition 

Several of the results bear directly on the previously proposed formation of reversible, 
multimeric complexes (Reale, A., et al., 1995, DNA binding and methyl transfer 
catalysed by mouse DNA methylfransferase, Biochem. J. 312:855-861) and the 
inhibition of DCMTase activity at high DNA concentrations (Hitt, M.M., et al., 1988, 

20 De novo and maintenance DNA methylation by a mouse plastytoma cell DNA 
methyltransferase, J. Biol. Chem. 263:4392-4399). An understanding of the 
fimctional form(s) of the DCMTase is essential for fiiture structure-fimction analysis, 
and the mechanism of DNA-mediated inhibition may be important for in vivo 
regulation of the enzyme. The DCMTase in the absence of either DNA or AdoMet 

25 exists as a monomer, as determined by size exclusion chromatography (Xu, G., et al, 
1995, Purification and stabilization of mouse DNA methyltransferase, Biochemi. 
Biophysi. Res. Communi. 207:544-551). Active site titration suggests that the 
enzyme is a fimctional monomer (Flynn, J., et al, 1996, Murine DNA cytosine-C5 
methyltransferase: Pre-steady- and steady-state kinetic analyses with regulatory 

30 DNA sequences, Biochemistry 35:7308-7315). Further support for a 1:1 enzyme to 
DNA catalytic association was provided by gel mobility shift analyses (see Example 
!)• 
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Hitt et al,. 1988, De novo and maintenance DNA methylation by a mouse plastytoma 
cell DNA methyltransferase, J. Biol. Chem. 263:4392-4399) proposed that the 
DCMTase is inhibited at high DNA concentrations by partitioning to a monomeric 
en2yme bound to DNA, and that protein muhimerization results in enzyme activation. 
5 An alternative explanation could be that substrate inhibition occurs with the formation 
of a ternary complex (DCMTase:DNA:DNA). These models were tested with a short 
DNA substrate that is less likely to support protein muitimerization than a long, 
muhi-site substrate. Both substrates clearly showed inhibition at high DNA 
concentrations, and the normalized inhibition profiles appear very similar (Figure 11). 
10 The corresponding substrate inhibition constants, Ki, are 150 to 180 times greater than 
Y^rrF^^ for these very different DNA molecules (Table 3). 

The similar Km/K; ratios suggest that the substrate inhibition is insensitive to the 
number of CpG or Cpl dinucleotides within the DNA. Moreover, the concentration 

15 dependencies, particularly with the small DNA, show that the inhibition occurs via an 
intermolecular process. The results also suggest that the DCMTase has a second 
DNA-binding site with lower affinity for these substrates than the site involved in 
catalysis. The formation of an inhibitory, ternary complex that includes two DNA 
molecules is further supported by our inhibition studies with single-stranded DNA 

20 (see below) and the existence of DNA-binding peptide motifs residing in the non- 
catalytic amino-terminal domain of the enzyme (Bestor, T.H., 1992, Activation of the 
mammalian DNA methyltransferase by cleavage of a Zn binding regulatory domain, 
EMBO 1 1 :261 1-2617; Chuang, L.S., et al., 1996, Characterisation of independent 
DNA and multiple Zn-binding domains at the N terminus of human DNA-(cytosine- 

25 5_ methyltransferase: modulating the property of a DNA-binding domain by 

contiguous Zn-binding motifs, Chia, J., and Li, B.F.L., J. Mol. Biol. 257:935-948). 
Example 1 shows that DNA concentrations ten times higher than Km produce a second 
less mobile band that is consistent with a second DNA binding event. 

30 Kinetic Analysis of DNA and AdoMet Binding 

Knowledge of the order of substrate binding and product dissociation is of critical 
importance to imderstanding an enzyme mechanism and the mechanisms of particular 
inhibitors. A first step in the kinetic characterization for DCMTase is shown in Figure 
12. Several observations suggest that DNA binds first. The dead-end inhibition 
35 observed in Figures 14 and 1 5 with GC-box b^^^ implicates DNA (substrate) binding 
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prior to the inhibitor (ssDNA). These inhibition patterns are inconsistent with both a 
random mechanism and an ordered addition in which AdoMet must bind prior to 
DNA (Cleland, W.W., 1963b, The kinetics of enzyme-catalyzed reactions with two or 
more substrates or products II. Inhibition: Nomenclature and theory, Biochimi. 
5 Biophysi. Acta 67:173-187). The gel shift experiments (Example 1) clearly show that 
DCMTase can bind DNA in the absence of AdoMet. Moreover, cofactor addition had 
no detectable effect on the binding affinity. While the catalytic competence of the 
initial binding event is uncertain, the stability of the complex is dependent on DNA 
sequence. 

10 

The product inhibition studies provided both arguments for and against the classic 
ordered Bi-Bi mechanism shown in Figure 22. Two studies were inconsistent with the 
proposed kinetic order: poly(dId'"C:dId"'C) inhibition with varied poly(dIdC:dIdC), 
constant AdoMet (Figure 19) and AdoHcy inhibition v^th varied AdoMet, constant 

15 poly(dIdC:dIdC) (Figure 16). In the first case, it is proposed that the complexities 

involved with a second DNA binding site have complicated the classic model in ways 
that are difficult to assess from just these studies. In the second case, AdoHcy 
exhibited competitive inhibition, but noncompetitive is expected. It may be that 
AdoHcy is so similar to AdoMet, in that they differ by a methyl group, that the 

20 product inhibition does not behave classically. The Theorell-Chance mechanism 
could also explain this result. Poly(dId"'C:dId'^C) inhibition with varied AdoMet, 
constant poly(dIdC:dIdC) (Figure 18) was consistent with the mechanism proposed. 
Also, it must be considered that the above three product inhibition studies are 
consistent with many different mechanisms (Segel, 1975, Enzyme kinetics behavior 

25 and analysi sof rapid equilibrium an dsteady-state enzyme systems, John Wiley, New 
York, pg 653). The fourth product inhibition study, AdoHcy inhibition with varied 
poly(dIdC:dIdC) and constant AdoMet, appeared somewhat complicated (Figiire 
17A-C). Not only is the result consistent with the proposed mechanism, it is imiquely 
characteristic of it (Segel, 1975, supra). 

30 

An overwhelming amount of the data presented herein support a kinetic order as 
follows: DNA binds, then AdoMet binds and catalysis occurs, AdoHcy leaves 
followed by methylated DNA (Figure 22). This proposed mechanism is similar to that 
described by Wu & Santi (1987, Kinetic an dcatalytic mechanism of Hhal 
35 mehtyltransferase, J. Biol. Chem. 262:4778-4786) for the bacterial DCMTase, 
M.Hhal. We suggest that the intersecting double reciprocal plots for a rapid 
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equilibrium mechanism observed with M.Hhal and our observation of double 
reciprocal plots that intersect far from the jr-axis w^ith the murine DCMTase may be 
reconciled by differences in the lifetimes and partitioning of both the enzyme:DNA 
and en2yme:DNA:AdoMet intermediates. 

5 

DN A length contributes to catalytic efficiency 

The investigations into poly(dIdC:dIdC) length produced some interesting fmdmgs. 
The apparent Km systematically increased 14-foid when decreasing the length from 
5000 to 1 00 base-pairs. On the contrary. Table 6 shows that kcat only decreases by 
10 one-third. This suggests that assembly of the competent enzyme :DNA:AdoMet 
complex is difficult and longer DNA promotes catalysis better than small DNA. 
However, once the complex is formed catalysis can proceed about as well with 1 00 or 
5000 base-pair poly(dIdC:dIdC). 

15 Facilitated diffusion of DNA binding proteins and enzymes is a well characterized 
phenomenon (Surby & Reich, 1996a, Contribution of facilitated diffusion and 
processive catalysis to enzyme efficiency: implications for the EcoRI restriction- 
modification system. Biochemistry 35:2201-2208, Surby & Reich 1996b, Facilitated 
diffusion of the EcoRI DNA methyltransferase is described by a novel mechanism, 

20 Biochemistry 3 5 :2209-22 1 7). It appears fi-om these studies that DCMTase also uses 
facilitated diffusion to seek and stabilize the catalytic complex. The specificity 
constant determined of 6.1 xlO^ sec"' M~' is within an order of magnitude of the 
diffusion controlled limit and because this enzyme is unusually slow, kcat under 30 hr " 
\ it is expected that facilitated diffusion contributes largely to catalysis. Processivity 

25 has not been addressed in our studies, however, the kinetic mechanism proposed is 
that expected for a processive enzyme. 

Identification of a potent, reversible inhibitor 

The finding that single-stranded GC-box a and GC-box b bind with reasonable 
30 affinity (Example 1) was somewhat surprising given our inability to detect a 

significant methyl transfer activity with these sequences. The DCMTase is capable of 
modifying other ssDNA (Flynn, J., et al., 1996, Murine DNA cytosine-C5 
methyltransferase: Pre-steady- and steady-state kinetic analyses with regulatory 
DNA sequences. Biochemistry 35:7308-7315). When using poly(dI-dC:dI-dC) as the 
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substrate, GC-box b and GC-box b"^^^ showed noncompetitive and uncompetitive 
inhibition patterns, respectively (Figures 13 and 14). Both patterns require that the 
inhibitors and poly(dI-dC:dI-dC) bind to distinct sites on the enzyme surface. An 
uncompetitive pattern for GC-box b^'^^ suggests that potent inhibition is through an 
5 allosteric site and, in conjunction with the competitive inhibition with AdoMet, 

strongly implies inhibitor binding to the DCMTase:poly(dI-dC:dI-dC) complex in an 
ordered Bi-Bi mechanism (Spector & Cleland, 1981, Meanings of Ki for conventional 
and alternative-substrate inhibitors, Biochem. Pharm. 30:107). The noncompetitive 
inhibition pattern observed with GC-box b may result through weaker binding at the 

1 0 proposed allosteric site as well as binding at the active site. It is further speculated that 
the allosteric site is the same site where substrate inhibition originates. Because GC- 
box b and GC-box b'^^^ differ only in the methylation state of the single CpG, the 
200-fold increased inhibition by GC-box b^^^ is derived from the presence of the 
methyl group. Whether potent inhibition is caused by the methyl group itself or by 

15 greater DNA structural differences is not known. 

Knowing that the DCMTase proceeds through the catalytic cycle in an ordered Bi-Bi 
mechanism allows for the determination of Ki, the dissociation constant for GC-box 
b"^"^ from the DCMTase:poly(dI-dC:dI-dC) complex (Spector & Cleland, 1981, 
20 Meanings of Ki for conventional and alternative-substrate inhibitors, Biochem. 
Pharm. 30: 1-7). Kjj and Kis are conditional and can vary, thus Ki is the proper 
comparative. It is related to Kii by this relation: Kii = Kj (I + [AdoMetl/Km^**"^^'). 
Solving for Kj using the experimental data from Figure 14 it is found that Ki = 2.5 
nM, a value about 10-fold lower than catalytic inhibition constant Ki. 

25 

Conclusion 

Regulation of DNA replication and transcriptional activation by single-stranded DNA 
is known to occur (Takai, T., et al., 1994, Molecular cloning of MSSP-2, a c-myc 
gene single-strand binding protein: characterization of binding specificity and DNA 

30 replication activity. Nucleic Acids Res. 22:55776-558 1 ; Rajavashisth, T.B., et al., 

1989, Identification of a zinc finger protein tha tbinds to the sterol regulatory element, 
Science 245:640-643; Tomonaga, T., & Levens, D., 1996, Activating transcription 
from single stranded DNA, Proc. Natl. Acad. Sci. USA 93:5830-5835). Nucleic acid 
regulation of DCMTase activity has previously been demonstrated. However, the 

35 requirement for micromolar concenfrations of the polynucleic acids studied by Bolden 
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et al. (1984, DNA methylation. Inhibition ofde novo and maintenance methylation in 
vitro by RNA an dsynthetic polynucleotides, J. Biol. Chem 259:12437-12443) to 
inhibit DCMTase implicates poor binding to the same site suggested in our studies 
with GC-box b'^^^, or direct binding at the active site as competitive inhibitors. A 
5 stimulatory, m-regulation by methylated CpG sites was reported to occur within 
single-stranded DNA using crude extracts (Christman, J.K., et al, 1995, 5-Methyl- 
2'-deoxycytidine in single-stranded DNA can act in cis to signal de novo DNA 
methylation, Proc. Natl. Acad. Sci. USA 92:7347-7351). While the mechanisms of 
regulation remain obscure in these cases, it is clear that they are distinct from the 

10 inhibition described herein. As previously stated, synthetic peptides mimicking 
portions of the DCMTase amino-terminus have been shown to gel mobility shift 
double-stranded DNA (Chuang, L.S., et al., 1996, Characterisation of independent 
DNA and multiple Zn-binding domains at the N terminus of human DNA-(cytosine- 
5) methyltransferase: modulating the property of a DNA-binding domain by 

15 contiguous Zn-binding motifs, Chia, J., and Li, B.F.L, J. Mol. Biol. 257:935-948). 
Although single-stranded DNA was apparently not studied, it would be interesting to 
systematically test these polypeptides for single-stranded DNA binding with and 
without methylated CpG dinucleotides. 

20 The major finding of this Example concerns the identification of a second nucleic acid 
binding site that modulates the activity of DCMTase. Both the substrate inhibition 
studies and the dead-end inhibition studies with CD-box b*^^ provide strong 
evidence for the existence of an allosteric site on the DCMTase surface. The kinetic 
studies demarcate the "allosteric" site, which is necessarily different from the "active" 

25 site where catalysis occurs. The novelty of these findings are drawn from the 

mechanistic insights that define the workings of the en2yme and the modulator in 
ways that have not been accessible to previous investigators. 

GC-box b'^^^ is distinct in form and function from previously described DCMTase 
30 inhibitors. There is a need for DCMTase inhibitors that are not incorporated into DNA 
and that are mechanistically unlike 5— azadeoxycytidine (Belinsky, S.A., et al., 1996, 
Increased cytosine DNA-methyltransferase activity is target-cell-specific and an 
early event in lung cancer, Proc. Natl. Acad. Sci. USA 93:4045-4050; Szyf, M., 1996, 
The DNA methylation machinery as a target for anticancer therapy, Pharmacol. Ther. 
35 70: 1-37; Jones, P.A., 1 996, DNA methylation errors and cancer, Cancer Res. 

56:2463-2467). GC-box b'^^^ clearly interacts with a region of the enzyme that is 
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distinct from the active site and is highly sensitive to the presence of 5-methyl 
cytosine. The modulator described herein is a reversible antagonist of DCMTase 
function that provides a new class of therapeutics for treating developmental disorders 
such as cancer. 

5 

Example 3: Anti-proliferative Effects of DCMTase Inhibitors on Cells 

It has been observed on several occasions that incubating mouse erythroleukemia 
(MEL) cells with GC-box b'^^'^, GC-box p and GC-box p^"^ slows down the growth 
rate. The effect was shown to be concentration dependent. Inhibitor induced anti- 

10 proliferation was greatest at a concentration of 10 micromolar, 1 micromolar produced 
a moderate effect and O.I micromolar concentrations produced only a small difference 
in growth rate in comparison to untreated cells. As observed under the light 
microscope, concentrations of GC-box p"^^^ and GC-box p exceeding 2.5 
micromolar induced MEL cells to produce small refractory particles of unknown 

15 content. GC-box b^^ also was observed to produce these particles at similar 

concentrations. The decrease in growth rate became more apparent after six days and 
three passages of the cells to fresh media containing the same inhibitor concentrations. 
Also, larger cells began to populate the culture after three days in a similar 
concentration-dependent manner. These large cells contained multiple nuclei and 

20 increased in number as length of incubation increased. After five days of incubation 
with 10 micromolar GC-box p'^^ the large multi-nucleate cells were observed to 
occur at about one in fifty regularly sized cells. Large multi— nucleate cells were also 
induced using the DCMTase anti— sense phosphorothioate used by Ramachandani et 
al., 1997 at a concentration of 10 micromolar. 
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SEQUENCE LISTING 

U) GENERAL INFORMATION 

<i) APPLICANT: Reich, Norbert O. 

Flynn, James 

(ii) TITLE OF THE INVENTION: MODULATORS OF DNA CYTOSINE-5 

METHYLTRANSFERASE AND METHODS FOR USE THEREOF 

(iii) NUMBER OF SEQUENCES: 110 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merchant, Gould, Smith, Edell, Welter & Schmidt 

(B) STREET: 11150 Santa Monica Boulevard, Suite 400 

(C) CITY: Los Angeles 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 90025 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 12-JUN-1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/057,411 
<B) FILING DATE: 29-AUG-1997 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Canady, Karen S 

(B) REGISTRATION NUMBER: 39,927 

(C) REFERENCE /DOCKET NUMBER: 30794. 30WO01 

(ix> TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 310 445-1140 

(B) TELEFAX: 310 445-9031 

(C) TELEX: 

<2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
GGGGGGGRRK KGCGKGGKGK KGKKGG 2 6 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
KGGRKKRDDD KRCGKRRDKK KKKKKG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGGGCGGGGC 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CCAACGTTGG 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CCAAGATTGG 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHAFIACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CGATCGATCG 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
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CCAACGTTGG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCGTACGTAC GG 

{2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGGAATTCAA GGGGCGGGGC AAGGATCCAG 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTGGATCCTT GCCCCGCCCC TTGAATTCCC 30 
(2) INFOR^dATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGGAATTCAA ATGACGTCAA AAGGATCCAG 30 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CTGGATCCTT TTGACGTCAT TTGAATTCCC 

(2) INFORMATION FOR SEQ ID NO: 13: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCTACCCACC CTGGATCCTT GCCCCGCCCC TTGAATTCCC AACCCTCCAC 
(2) INFORmTION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATCCTTGCCC CGCCCCTTGA AT 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TTGCCCCGCC CCTT 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 : 



(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GTGGGATGGG AACGAGTTGA GGAGGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGTGGTATGT ATCGATTATA GTTGGG 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 
GGAGGAAGTT TACGTATGGT ATGGGG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(XX) SEQUENCE DESCRIPTION: SEQ ID N0:21: 
TGGGAGGGGA TTCGAGGTGA GAGTTG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
ATAAAGTATT AGCGTAAGAG ATGAAG 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TGGAGGAGTT TACGGTGTAA TTGTTT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRAW DE ONES S: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GGAGTAGGTA GACGTTAAGT ATGATG 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GTGGGAAGGG GACGAATTTG AAGGTG 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TGGTAATGTA TTCGTAAATG TAAGGG 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TAATAGGGGA GACGTAAATG TAAGGG 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
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GAGTGTAGAA GTCGTAATAG ATTTAG 

{2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TGAGTAGGAA AGCGAAGAGG TGTTGG 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
TAGGTATTGG GGCGGAAGGT GGGTGG 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GGGGGTATAA TACGGTGTTG GTAGGG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GGGTTGGGGT TTCGTGTGGG GGGTGT 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
TGTGGGTATG GGCGGTGATA GTGAAG 

(2) INFORMATION FOR SEQ ID NO: 34: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GGATGATGGG GTCGAGAGTG GTGGTG 

(2) INFORMATION FOR SEQ ID ^30:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TAGTGGGTGG AGCGAGTGGT GGTTGG 

(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
AGGGTGGGTG GGCGGAGTTG TTGTTG 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GTGAGGAGGG AGCGGGAATG GGGGTG 2 6 

(2) INFORMATION FOR SEQ ID NO: 38: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GGGGGTGGGG AGCGGAGGGG GGTGAG 26 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

TGTTGGAGGG GGCGAAGGTG GTTTTG 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i> SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GGGGGGGGGG GGCGAGGGGT AGATGG 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GGGGGAGGGG TTCGGTGATA GGTAGG 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GGGGGGGGGG TACGTGGGAT GGTATG 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GTGTAGGGAG TGCGAGGGGG TGTAAA 2 6 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
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GGGGGGGGGT AGCGGTTAGA TGGTGG 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
{B) TYPE: nucleic acid 

(C) STRAKDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GGGGTGAGGG GGCGGGGGTT AGTGGG 

(2) INFORMATION FOR SEQ ID NO: 4 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES3: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GAGGGGGGGT TGCGTAGGGG GGTGGG 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
TGTGGAGGTG GGCGGGAAAG GTGATG 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
GGGGGGATGG GACGGATGGG GGGGGG 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GGGGGTGGGG TGCGAGAGAG TTGGGG 

(2) INFORMATION FOR SEQ ID NO: 50: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

GAGGGGTGGA GGCGGAGGTG GGTTGG 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
(B} TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GGGGGGGGGG GGCGGATAAG GGTGTG 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
TGGGGGGGGG GGCGGGGGGA GTTTGA 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGGGGGGAGG GGCGGATAGT TGTGTG 

{2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
GGGTGGGGTG GGCGGTGGGG TGTGGG 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GAGGGGGGGG AGCGGAGGGG GTTGGG 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 
GGGGGGGAAG GGCGTGGGGT TGGGTG 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 
GGGAGGGGGG GCGATGGGGT GGTGG 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GGGTGGGGGT GGCGTTGTGG GTGGGG 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GGGAGGGGGT GGCGGTGGGT ATGTGG 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

GGGGAGGGTG GGCGGGTATG GAGTGG 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 26 base pairs 
<B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GGGGGGGGAG TGCGTTGATG GGTGTG 

(2) INFORMATION FOR SEQ ID NO: 62: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
GGGGGGGTGG ATCGTGGGGG GAGGGG 

(2) INFORMATION FOR SEQ ID NO: 63: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

GGGGTAGGGT GGCGGGGGGG GTATGG 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GGGATGGGGG TGCGGGGTAT GGGGGG 2 6 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



GGGAGGGGGT AGCGGGAGTG TGTGTG 
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(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
GGGGGTAAGG GGCGTAAGAA TGGGGG 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
GGGGGGGTGG TTCGGTAATG GGGGGT 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 68 : 
GGTGGGAGAG GGCGTGGTGT AGGTAG 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
GGGGGGGGTG TACGAGGTTT GTGTGG 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
TGGTGGAGGG GGCGAAGAAG TGTGTG 

(2) INFORMATION FOR SEQ ID NO: 71: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 base pairs 



wo 99/12027 



PCT/US98/12351 



-74- 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
GGGGGTGGGA TGCGGAATAA GGATGG 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
TGAGGGGGAG GGCGAATAGA TGGTGG 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
GGGGGGAGTA AGCGGGGGTG TGGTGG 

(2) INFORMATION FOR SEQ ID NO:74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
TGAAGGGGGG TGCGGGGTGT GGGGG 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
GTGGTGATGG GGCGGGGTGG TAGTGG 

(2) INFORMATION FOR SEQ ID NO: 7 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
TGGAGGGGTA GGCGTGGGGT GATGGG 

<2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
GGTAGGGAGT GGCGGGTGGT GATGGG 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
GGGTGTAGAG GGCGGGAGTA GAGGGG 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
GGGTGGGTTT GGCGTAATTG TGTGGG 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
GGGTGTGTTG GGCGTGGGGT ATGTAG 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
TGGGGAGAAT GGCGGGGGGT GGTGGG 
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(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
TATGGTGGGA GGCGGGGGGG GGTTGG 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
TGGGGAAAGA GGCGTGAGTG GGGGGG 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
TGTAGGGGAG GACGGGGGAT GGGGTG 

(2) INFORMATION FOR SEQ ID NO:85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
GGGTGGGTAA TGCGTAGGGT GGGGGG 

(2) INFORMATION FOR SEQ ID NO: 86: 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
GTGTGGGTAA GGCGGTATGG GGGTGG 

(2) INFORMATION FOR SEQ ID NO: 87: 
(i) SEQUENCE CHARACTERISTICS: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
TGGAGGGTGT TGCGGTGAGG TGGTGG 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
GGTGGTGGTG ATCGGGGTTG TGATGG 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
GGGGGTAAAG TGCGGGTGGT TGATGG 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

GTGGAGGTGT TGCGTAGTGT GGGAGG 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

GTGGGGAATG GTCGGTTATG GTGGGG 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
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GGGATGTGGT AGCGGGGGTG TGTTAG 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
GGGGTAGGAG TTCGTAGGGG TGTGTT 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
GAGGTGGTGG ATCGGGATGA TGGATT 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
TGGGGGGAAA TACGGGGAGG GTGGTA 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GGAGTAGGGT TACGTGGTGG TAATGG 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
GAGGAGTAAA GGCGTGTGTT GTGGTG 

(2) INFORMATION FOR SEQ ID NO: 98: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 : 
TGGATGAGAG TGCGTGTATG ATAAGG 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

AGGGTTAGTG AACGGGGGGG AGGTGG 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GAGAAGGGTA AACGTGGGGG AGGGGA 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
TGGGGGGGGG GGCGGGGGGA GTTTGA 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
GAACAATGGG GGCGCTGGGG GGGGGGGCGG GGGGGCTTTA GCTATGTCAG AATTCA 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
GGGATGGGGG TGCGGGGTAT GGGGGG 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
GGGGAACAGC GAGCACCGAA GGGGGTGCGG GGTATGGGAG GGTCCCCGGG CTTGAGC 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GGTGGTGGTG ATCGGGGTTG TGATGG 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
TGTCCTTCTT GTGGTGGTGG TAGAGGTCGT GGTTGTGATG GTGGCTCGGT GTGTGT 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 
GAGGGGGGGG AGCGGAGGGG GTTGGG 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:108: 
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GAGGGGTGTA GCCGCGAGGG GGCGGAGCGG AGGGGGAGGG CCCTGGTCCC GCCGCC 5 6 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
CCCCACCCAC AACGCCACCC CCACCC 2 6 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: ' 
TCTTTAAATG GTGCGGTCCA CCCCCACCGC CACCCCCACC CCCCACTGGA GCAAGG 56 
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What is claimed is: 



A synthetic oligonucleotide comprising a 05 methylcytosine and which recognizes 
and binds an allosteric site on DNA cytosine methyitransf erase (DCMTase) thereby- 
modulating DCMTase activity associated with the allosteric site. 

The synthetic oligonucleotide of claim 1, wherein the modulating comprises 
inhibition. 

The ^thetic o%onucleotide of daim 1, wherein the modulating comprises 
activation. 

The synthetic oligonucleotide of claim 1, wherein the C-5 methylcytosine is present 
as a 5mCpG dinudeotide. 

The synthetic oligonudeotide of daim 1, wherein the DCMTase is from a mammal, 
bird, fish, amphibian, reptile, insect, plant or fungus. 

The synthetic oligonudeotide of daim 5, wherein the mammal is sdected from the 
group consisting of mouse and human. 

The synthetic oligonudeotide of daim 1 having an inhibition constant of not greater 
than 1000 nM. 

The synthetic oligonudeotide of daim 7 having an inhibition constant of not greater 
tiian 200 nM. 

The synthetic oligonudeotide of daim 8 having an inhibition constant of not greater 
than 20 nM. 

The synthetic oligonudeotide of claim 1 comprising a nudeotide sequence as shown 
in Figure IB and designated GC-box b^ (SEQ ID NO:10), GC-box p^ (SEQ 
ID NO:10), GC-box c^ (SEQ ID NO:13), GC-box d^(SEQ ID NO:14), GO 
box e^ (SEQ ID NO:15), or CRE a^ (SEQ ID NQll). 

A method of inhibiting methjdarion of DNA comprising contacting a DCMTase 
with a synthetic inhibitor molecule so as to form an en2yme/ synthetic inhibitor 
molecule complex in the presence of the DNA, wherein the synthetic inhibitor 
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molecule comprises a C-5 methykytosine which recognizes and binds an allosteric 
site on DCMTase, dierehy inhibitir^ DNA meAykransferase activity. 

12. A method of inhibiting proliferation of cancer cells comprising administering to a 
subject a synthetic inhibitor molecule which recognizes and binds an allosteric site 
on DCMTase thereby resulting m an enzyme/ synthetic inhibitor molecule complex, 
the presence of the complex inhibiting DCMTase-mediated methyladon of DNA, 
thereby inhibiting proliferation of the cancer cells. 

1 3 . The method of claim 12, wherein the cancer cell is from lung, breast, prostate, 
pancreas or colon. 

14. The method of claim 1 1, whereia the synthetic inhibitor molecule is an 
oligonucleotide of any one of claims 1-10. 

15. The method of daim 12 or 13, wherein the subjea is a human. 

16. The method of claim 12 or 13, wherem the subject is an animal. 

17. The method of daim 16, wherein the animal is porcine, piscine, avian, feline, equine, 
bovine, ovine, caprine or canine. 

18. A method of identifying a molecule which recognizes and binds an allosteric site on 
DCMTase comprising: 

(a) contacting a molecule with DCMTase in the presence of DNA and 
AdoMet; 

(b) measuring DCMTase activity, an increase or decrease in DCMTase activity 
being indicative of a modulator of DCMTase; and 

(c) determining whether the modulation of DCMTase activity is via binding an 
allosteric site on DCMTase. 

19. The method of claim 18, wherein the modulator is an inhibitor. 

20. The method of daim 18, wherein DCMTase activity is measured using a steady- 
state assay. 
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2 1 . The method of claim 12, wherein the synthetic inhibitor molecule comprises a C-5 
methjdcytosine. 

22. The method of claim 12, wherein the synthetic inhibitor molecule is an 
oligonucleotide of any one of claims 1-10. 

5 23 . The method of claim 14, wherein the subjea is a human. 

24. The method of claim 14, wherein the subject is an animal. 

25 . The method of daim 24, wherein the animal is porcine, piscine, avian, feline, equine, 
bovine, ovine, caprine or canine. 
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FIG.7a. 

STARTING POPULATION 

GTGGGATGGGAACGAGTTGAGGAGGG 

AGTGGTATGTATCGATTATACGTTGGG 

GGAGGAAGTTTACGTATGGTATGGGG 

TGGGAGGGGATTCGAGGTGAGAGTTG 

ATAAAGTATTAGCGTAAGAGATGAAG 

TGGAGGAGTTTAUjGGTGTAATTGTTT 

GGAGTAGGTAGACGTTAAGTATGATG 

GTGGGAAGGGGACGAATTTGAAGGTG 

TGGTAATGTATTCGTAAATGTAAGGG 

TAATAGGGGAGACGTAAATGTAAGGG 

GAGTGTAGAAGTCGTAATAGATTTAG 

TGAGTAGGAAAGCGAAGAGGTGTTGG 

FIG.7b. GENERATION 1 

TAGGTATTGGGGCGGAAGGTGGGTGG 
GGGGGTATAATACGGTGTTGGTAGGG 
GGGTTGGGGTTTCGTGTGGGGGGTGT 
TGTGGGTATGGGCGGTGATAGTGAAG 
GGATGATGGGGTCGAGAGTGGTGGTG 
TAGTGGGTGGAGCGAGTGGTGGTTGG 
AGGGTGGGTGGGCGGAGTTGTTGTTG 
GTGAGGAGGGAGCGGGAATGGGGGTG 
GGGGGTGGGGAGCGGAGGGGGGTGAG 
TGTTGGAGGGGGCGAAGGTGGTTTTG 

FIG.7C. GENERATIONS 

GGGGGGGGGGGGCGAGGGGTAGATGG 

GGGGGAGGGGTTCGGTGATAGGTAGG 

GGGGGGGGGGTACGTGGGATGGTATG 

GTGTAGGGAGTGCGAGGGGGTGTAAA 

GGGGGGGGGTAGCGGTTAGATGGTGG 

GGGGTGAGGGGGCGGGGGTTAGTGGG 

GAGGGGGGGTTGCGTAGGGGGGTGGG 

TGTGGAGGTGGGCGGGAAAGGTGATG 

GGGGGGATGGGACGGATGGGGGGGGG 

GGGGGTGGGGTGCGAGAGAGTTGGGG 

GAGGGGTGGAGGCGGAGGTGGGTTGG 

GGGGGGGGGGGGCGATAAGGGTGTG 
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FIG.12a. 
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FIG.20. 
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FIG.23a. 
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As a below named inventor I hereby declare that: my residence, post office address and citizenship are as stated below 
next to my name; that 

I verily believe I am the original, first and sole inventor (if only one name is listed below) or a joint inventor (if pltiral 
inventors are named below) of the subject matter which is claimed and for which a patent is sought on the invention 
entitled: 

MODULATORS OF DNA CYTOSINE-5 METHYLTRANSFERASE AND METHODS FOR USE THEREOF 
The specification of which: 

a. n is attached hereto. 

b. ^ was filed on June 12, 1998 as PCT International AppKcation Number PCr/US98/ 12351, which I have reviewed 
and for which I solicit a United States patent. 

I hereby state that I have reviewed and understand the contents of the above- identified specification, mcluding the 
claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose information which is material to the patentability of this application in accordance 
with Tide 37, Code of Federal Regulations, § 1.56 (attached hereto). 

I hereby claim foreign priority benefits under Title 35, United States Code, § 119(a)- (d) or 365(b) of any foreign 
application(s) for patent or inventor's certificate or 365(a) of any PCT international application which designated at least 
one country other than the United States of America, listed below and have also identified below any foreign appKcation 
for patent or inventor's certificate or any PCT application having a filing date before that of the application on the basis 
of which priority is claimed: 



a. ^ no such applications have been fUed. 

b. Q such applications have been fUed as follows: 



FOREIGN APPLICATION(S), IF ANY, CLAIMING PRIORITY UNDER 35 USC % 119 


COUNTRY 


APPLICATION NUMBER 


DATE OF FILING 
(day, month, year) 


DATE OF ISSUE 
(day, month, year) 
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COUNTRY 


APPLICATION NUMBER 


DATE OF FILING 
(day, month, year) 


DATE OF ISSUE 
(day, month, year) 











I hereby claim the benefit under Tide 35, United States Code, § 120 of any United States application(s), or 365(c) of any 
PCT international application(s) designating the United States of America, listed below and, insofar as the subject matter 
of each of the claims of this application is not disclosed m the prior United States or PCT international application in the 
manner provided bythe first paragraph of Title 35, United States Code, § 112, 1 acknowledge the dutyto disclose 
material information as defined in Tide 37, Code of Federal Regulations, § 1.56(a) which occurred between the filing 
date of die prior application and the national or PCT international filing date of this application. 
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U.S. PARENT APPLICATION OR 
PCT PARENT NUMBER 


DATE OF FILING (day, 
month, year) 
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I hereby claim the benefit under Title 35, United States Code § 119(e) of any United States provisional appKcation(s) 
listed below: 



U.S. PROVISIONAL APPLICATION NUMBER 


DATE OF FILING pay, Month, Year) 


60/057,411 


29 August 1997 



I hereby appoint the following attorneys to prosecute this application and to transact all business in the Patent and 
Trademark Office connected herewith: 



George K Gates Registration N o. 33,500 , 

Victor G. Cooper Registration Ng., 39,641 

; Anthony J. Drier Registration No. JJ^SZ 

Karen S.Canady Registration No.i9^27 

William J. Wood Registration No"j2^6 

Jason S. Feldmar Registrauon No. 39ji7_ 



I hereby authorize them to act and rely on instructions from and communicate dlrecdy with the 

person/ assignee/ attorney/firm/ organization who/ which first sends/ sent this case to them and by whom/ which I 

hereby declare that I have consented after fuU disclosure to be represented unless/until I instruct Gates & Cooper to the 

contrary. 

Please direct all correspondence in this case to the firm of Gates & Cooper at the address indicated below: 



^ GAllS_&a3OTR 
In ward Hug hes Center 
670 1 Center Drive W est. SuitglOS^^ 
Los Angeles, CA 90045 ^ 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made on 
information and belief are believed to be true; and further that these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of 
Tide 18 of the United States Code and that such willful false statements may jeopardize the vaHdity of the application or 
any patent issued thereon. 



(1) 


Full Name 
Of Inventor 


Family Name 

_REICH 


First Given Name 
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State or Foreign Country 
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Date: 
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^Ifita 


State or Foreign Country 

California 93 117 
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§ 1.56 Duty to disclose information material to patentability. 

(a) A patent by its very nature is affected with a public interest. The public interest is best served, and the most effective 
patent examination occurs when, at the time an application is being examined, the Office is aware of and evaluates the 
teachings of aU information material to patentability. Each individual associated with the filir^ and prosecution of a 
patent application has a duty of candor and good faith in dealing with the Office, which includes a duty to disclose to the 
Office all information known to that individual to be material to patentability as defined in this section. The duty to 
disclose information exists with respect to each pending claim until the claim is canceled or withdrawn from 
consideration, or the application becomes abandoned. Information material to the patentability of a claim that is 
canceled or withdrawn from consideration need not be submitted if the information is not material to the patentability 
of any claim remaining under consideration in the appUcation. There is no duty to submit information which is not 
material to the patentability of any existing claim. The duty to disclose all information known to be material to 
patentabiKty is deemed to be satisfied if all information known to be material to patentability of any claim issued in a 
patent was cited by the Office or submitted to the Office in the manner prescribed by §§ 1.97(b)- (d) and 1.98. Ffowever, 
no patent will be granted on an apphcation in connection with which fraud on the Office was practiced or attempted or 
the duty of disclosure was violated through bad faith or intentional misconduct. The Office encourages applicants to 
carefully examine: 

(1) prior art cited in search reports of a fore^n patent office in a counterpart application, and 

(2) the closest information over which individuals associated with the filing or prosecution of a patent 
application believe any pending claim patentably defines, to make sure that any material information contained 
therein is disclosed to the Office. 

(b) Under this section, information is material to patentability when it is not ctimtilative to information already of record 
or being made of record in the application, and 

(1) it establishes, by itself or in combination with other information, a prima facie case of unpatentability of a 
claim; or 



(2) it refutes, or is inconsistent with, a position the appHcant takes in: 

(i) opposing an argument of unpatentability relied on by the Office, or 

(ii) asserting an argy.ment of patentability. 
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A prima facie case of unpatentability is established \\4ien the information compels a conclusion that a claim is 
unpatentable under the preponderance of evidence, burden- of-proof standard, giving each term in the claim its broadest 
reasonable construction consistent with the specification, and before any consideration is given to evidence which may 
be submitted in an attempt to establish a contrary conclusion of patentability. 

(c) Individuals associated with the filing or prosecution of a patent application within the meaning of this section are: 

(1) each inventor named in the application: 

(2) each attorney or agent who prepares or prosecutes the application; and 

(3) every other person who is substantively involved in the preparation or prosecution of the application and 
who is associated with the inventor, with the assignee or with anyone to whom there is an obligation to assign 
the appHcation. 

(d) Individuals other than the attorney, agent or inventor may comply with this section by disclosing information to the 
attorney, agent, or inventor. 
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